Overview

Brought to you by YData

Dataset statistics

Number of variables115
Number of observations18866
Missing cells611074
Missing cells (%)28.2%
Total size in memory16.6 MiB
Average record size in memory920.0 B

Variable types

Text115

Dataset

DescriptionVertebrate Zoology Division - Mammalogy, Yale Peabody Museum 0061684-241126133413365
URLhttps://doi.org/10.15468/dl.shrths

Alerts

accessRights has constant value "Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj" Constant
language has constant value "http://creativecommons.org/publicdomain/zero/1.0/" Constant
license has constant value "CC0_1_0" Constant
rightsHolder has constant value "Yale Peabody Museum" Constant
type has constant value "PhysicalObject" Constant
institutionCode has constant value "YPM" Constant
collectionCode has constant value "VZ" Constant
ownerInstitutionCode has constant value "YPM" Constant
basisOfRecord has constant value "PRESERVED_SPECIMEN" Constant
dataGeneralizations has constant value "Coordinate data unavailable" Constant
occurrenceStatus has constant value "PRESENT" Constant
phylum has constant value "Chordata" Constant
class has constant value "Mammalia" Constant
nomenclaturalCode has constant value "ICZN" Constant
taxonRemarks has constant value "Animals and Plants: Vertebrates - Mammals" Constant
datasetKey has constant value "854f602e-f762-11e1-a439-00145eb45e9a" Constant
publishingCountry has constant value "US" Constant
mediaType has constant value "StillImage" Constant
phylumKey has constant value "44" Constant
classKey has constant value "359" Constant
protocol has constant value "EML" Constant
lastCrawled has constant value "2025-01-08T13:41:11.140Z" Constant
isSequenced has constant value "false" Constant
publishedByGbifRegion has constant value "NORTH_AMERICA" Constant
dataGeneralizations has 18800 (99.7%) missing values Missing
recordedBy has 4296 (22.8%) missing values Missing
sex has 10133 (53.7%) missing values Missing
lifeStage has 17963 (95.2%) missing values Missing
reproductiveCondition has 16576 (87.9%) missing values Missing
behavior has 18864 (> 99.9%) missing values Missing
preparations has 349 (1.8%) missing values Missing
associatedReferences has 12450 (66.0%) missing values Missing
associatedTaxa has 18487 (98.0%) missing values Missing
otherCatalogNumbers has 12652 (67.1%) missing values Missing
fieldNumber has 11555 (61.2%) missing values Missing
eventDate has 6567 (34.8%) missing values Missing
startDayOfYear has 7901 (41.9%) missing values Missing
endDayOfYear has 7901 (41.9%) missing values Missing
year has 6572 (34.8%) missing values Missing
month has 7472 (39.6%) missing values Missing
day has 7989 (42.3%) missing values Missing
habitat has 18739 (99.3%) missing values Missing
higherGeography has 3778 (20.0%) missing values Missing
continent has 3874 (20.5%) missing values Missing
waterBody has 18739 (99.3%) missing values Missing
countryCode has 3974 (21.1%) missing values Missing
stateProvince has 5347 (28.3%) missing values Missing
county has 9192 (48.7%) missing values Missing
municipality has 18309 (97.0%) missing values Missing
locality has 5869 (31.1%) missing values Missing
verbatimElevation has 17391 (92.2%) missing values Missing
decimalLatitude has 5543 (29.4%) missing values Missing
decimalLongitude has 5543 (29.4%) missing values Missing
coordinateUncertaintyInMeters has 5609 (29.7%) missing values Missing
georeferencedBy has 18537 (98.3%) missing values Missing
georeferencedDate has 10549 (55.9%) missing values Missing
georeferenceProtocol has 5610 (29.7%) missing values Missing
georeferenceSources has 5615 (29.8%) missing values Missing
georeferenceRemarks has 5661 (30.0%) missing values Missing
typeStatus has 18844 (99.9%) missing values Missing
identifiedBy has 17735 (94.0%) missing values Missing
dateIdentified has 17913 (94.9%) missing values Missing
identificationRemarks has 18863 (> 99.9%) missing values Missing
order has 406 (2.2%) missing values Missing
family has 684 (3.6%) missing values Missing
genus has 1248 (6.6%) missing values Missing
genericName has 1248 (6.6%) missing values Missing
specificEpithet has 2554 (13.5%) missing values Missing
infraspecificEpithet has 11638 (61.7%) missing values Missing
elevation has 17391 (92.2%) missing values Missing
elevationAccuracy has 18082 (95.8%) missing values Missing
distanceFromCentroidInMeters has 18788 (99.6%) missing values Missing
mediaType has 18411 (97.6%) missing values Missing
orderKey has 406 (2.2%) missing values Missing
familyKey has 684 (3.6%) missing values Missing
genusKey has 1248 (6.6%) missing values Missing
speciesKey has 2554 (13.5%) missing values Missing
species has 2554 (13.5%) missing values Missing
repatriated has 3910 (20.7%) missing values Missing
gbifRegion has 3929 (20.8%) missing values Missing
level0Gid has 5871 (31.1%) missing values Missing
level0Name has 5871 (31.1%) missing values Missing
level1Gid has 5888 (31.2%) missing values Missing
level1Name has 5888 (31.2%) missing values Missing
level2Gid has 5935 (31.5%) missing values Missing
level2Name has 5935 (31.5%) missing values Missing
level3Gid has 16539 (87.7%) missing values Missing
level3Name has 16544 (87.7%) missing values Missing
iucnRedListCategory has 7581 (40.2%) missing values Missing
gbifID has unique values Unique
bibliographicCitation has unique values Unique
references has unique values Unique
dynamicProperties has unique values Unique
occurrenceID has unique values Unique
catalogNumber has unique values Unique

Reproduction

Analysis started2025-01-08 23:32:52.429790
Analysis finished2025-01-08 23:32:54.507993
Duration2.08 seconds
Software versionydata-profiling vv4.12.1
Download configurationconfig.json

Variables

gbifID
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:54.683725image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters188660
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st row4953409301
2nd row4911830319
3rd row4911830318
4th row4911830317
5th row4911830316
ValueCountFrequency (%)
4953409301 1
 
< 0.1%
4599382340 1
 
< 0.1%
4911830315 1
 
< 0.1%
4911830314 1
 
< 0.1%
4911830313 1
 
< 0.1%
4911830312 1
 
< 0.1%
4911830311 1
 
< 0.1%
4911830310 1
 
< 0.1%
4911830309 1
 
< 0.1%
4911830308 1
 
< 0.1%
Other values (18856) 18856
99.9%
2025-01-08T18:32:54.946881image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 188660
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Common 188660
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 188660
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

accessRights
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:55.021396image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length129
Median length129
Mean length129
Min length129

Characters and Unicode

Total characters2433714
Distinct characters38
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
2nd rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
3rd rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
4th rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
5th rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
ValueCountFrequency (%)
open 18866
11.1%
access 18866
11.1%
http://creativecommons.org/publicdomain/zero/1.0 18866
11.1%
see 18866
11.1%
yale 18866
11.1%
peabody 18866
11.1%
policies 18866
11.1%
at 18866
11.1%
http://hdl.handle.net/10079/8931zqj 18866
11.1%
2025-01-08T18:32:55.135467image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 226392
 
9.3%
/ 188660
 
7.8%
150928
 
6.2%
t 132062
 
5.4%
o 132062
 
5.4%
a 113196
 
4.7%
c 113196
 
4.7%
i 94330
 
3.9%
n 94330
 
3.9%
s 94330
 
3.9%
Other values (28) 1094228
45.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1641342
67.4%
Other Punctuation 358454
 
14.7%
Decimal Number 207526
 
8.5%
Space Separator 150928
 
6.2%
Uppercase Letter 75464
 
3.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 226392
13.8%
t 132062
 
8.0%
o 132062
 
8.0%
a 113196
 
6.9%
c 113196
 
6.9%
i 94330
 
5.7%
n 94330
 
5.7%
s 94330
 
5.7%
l 94330
 
5.7%
p 94330
 
5.7%
Other values (12) 452784
27.6%
Decimal Number
ValueCountFrequency (%)
1 56598
27.3%
0 56598
27.3%
9 37732
18.2%
8 18866
 
9.1%
7 18866
 
9.1%
3 18866
 
9.1%
Other Punctuation
ValueCountFrequency (%)
/ 188660
52.6%
. 75464
 
21.1%
: 56598
 
15.8%
; 18866
 
5.3%
, 18866
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
P 18866
25.0%
O 18866
25.0%
Y 18866
25.0%
A 18866
25.0%
Space Separator
ValueCountFrequency (%)
150928
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1716806
70.5%
Common 716908
29.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 226392
13.2%
t 132062
 
7.7%
o 132062
 
7.7%
a 113196
 
6.6%
c 113196
 
6.6%
i 94330
 
5.5%
n 94330
 
5.5%
s 94330
 
5.5%
l 94330
 
5.5%
p 94330
 
5.5%
Other values (16) 528248
30.8%
Common
ValueCountFrequency (%)
/ 188660
26.3%
150928
21.1%
. 75464
 
10.5%
: 56598
 
7.9%
1 56598
 
7.9%
0 56598
 
7.9%
9 37732
 
5.3%
8 18866
 
2.6%
7 18866
 
2.6%
3 18866
 
2.6%
Other values (2) 37732
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2433714
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 226392
 
9.3%
/ 188660
 
7.8%
150928
 
6.2%
t 132062
 
5.4%
o 132062
 
5.4%
a 113196
 
4.7%
c 113196
 
4.7%
i 94330
 
3.9%
n 94330
 
3.9%
s 94330
 
3.9%
Other values (28) 1094228
45.0%
Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:55.318801image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length62
Median length50
Mean length40.04675077
Min length20

Characters and Unicode

Total characters755522
Distinct characters66
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowTamias striatus fisheri (YPM MAM 017903)
2nd rowPeromyscus leucopus noveboracensis (YPM MAM 017889)
3rd rowPeromyscus leucopus noveboracensis (YPM MAM 017897)
4th rowPeromyscus leucopus noveboracensis (YPM MAM 017895)
5th rowPeromyscus leucopus noveboracensis (YPM MAM 017888)
ValueCountFrequency (%)
ypm 18866
 
18.4%
mam 18866
 
18.4%
peromyscus 1837
 
1.8%
cinereus 1489
 
1.5%
sorex 1193
 
1.2%
brevicauda 1125
 
1.1%
blarina 976
 
1.0%
zibethicus 898
 
0.9%
talpoides 868
 
0.8%
gapperi 848
 
0.8%
Other values (20938) 55590
54.2%
2025-01-08T18:32:55.583951image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
83690
 
11.1%
M 58523
 
7.7%
0 44332
 
5.9%
s 41623
 
5.5%
i 36625
 
4.8%
a 35093
 
4.6%
u 30890
 
4.1%
e 30381
 
4.0%
r 26522
 
3.5%
o 25267
 
3.3%
Other values (56) 342576
45.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 370969
49.1%
Uppercase Letter 131912
 
17.5%
Decimal Number 126705
 
16.8%
Space Separator 83690
 
11.1%
Close Punctuation 18866
 
2.5%
Open Punctuation 18866
 
2.5%
Other Punctuation 4512
 
0.6%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 41623
11.2%
i 36625
9.9%
a 35093
9.5%
u 30890
 
8.3%
e 30381
 
8.2%
r 26522
 
7.1%
o 25267
 
6.8%
n 22452
 
6.1%
c 20781
 
5.6%
l 16432
 
4.4%
Other values (16) 84903
22.9%
Uppercase Letter
ValueCountFrequency (%)
M 58523
44.4%
P 21973
 
16.7%
A 19464
 
14.8%
Y 18866
 
14.3%
C 2505
 
1.9%
S 1952
 
1.5%
B 1452
 
1.1%
O 1312
 
1.0%
T 1217
 
0.9%
N 831
 
0.6%
Other values (14) 3817
 
2.9%
Decimal Number
ValueCountFrequency (%)
0 44332
35.0%
1 20061
15.8%
2 9843
 
7.8%
6 8199
 
6.5%
7 8066
 
6.4%
5 8017
 
6.3%
4 7780
 
6.1%
3 7635
 
6.0%
9 6550
 
5.2%
8 6222
 
4.9%
Other Punctuation
ValueCountFrequency (%)
. 4510
> 99.9%
? 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
83690
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18866
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18866
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 502881
66.6%
Common 252641
33.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 58523
 
11.6%
s 41623
 
8.3%
i 36625
 
7.3%
a 35093
 
7.0%
u 30890
 
6.1%
e 30381
 
6.0%
r 26522
 
5.3%
o 25267
 
5.0%
n 22452
 
4.5%
P 21973
 
4.4%
Other values (40) 173532
34.5%
Common
ValueCountFrequency (%)
83690
33.1%
0 44332
17.5%
1 20061
 
7.9%
) 18866
 
7.5%
( 18866
 
7.5%
2 9843
 
3.9%
6 8199
 
3.2%
7 8066
 
3.2%
5 8017
 
3.2%
4 7780
 
3.1%
Other values (6) 24921
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 755522
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
83690
 
11.1%
M 58523
 
7.7%
0 44332
 
5.9%
s 41623
 
5.5%
i 36625
 
4.8%
a 35093
 
4.6%
u 30890
 
4.1%
e 30381
 
4.0%
r 26522
 
3.5%
o 25267
 
3.3%
Other values (56) 342576
45.3%

language
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:55.650965image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length49
Median length49
Mean length49
Min length49

Characters and Unicode

Total characters924434
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://creativecommons.org/publicdomain/zero/1.0/
2nd rowhttp://creativecommons.org/publicdomain/zero/1.0/
3rd rowhttp://creativecommons.org/publicdomain/zero/1.0/
4th rowhttp://creativecommons.org/publicdomain/zero/1.0/
5th rowhttp://creativecommons.org/publicdomain/zero/1.0/
ValueCountFrequency (%)
http://creativecommons.org/publicdomain/zero/1.0 18866
100.0%
2025-01-08T18:32:55.759533image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 113196
 
12.2%
o 94330
 
10.2%
m 56598
 
6.1%
c 56598
 
6.1%
r 56598
 
6.1%
e 56598
 
6.1%
t 56598
 
6.1%
i 56598
 
6.1%
. 37732
 
4.1%
n 37732
 
4.1%
Other values (14) 301856
32.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 716908
77.6%
Other Punctuation 169794
 
18.4%
Decimal Number 37732
 
4.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 94330
13.2%
m 56598
 
7.9%
c 56598
 
7.9%
r 56598
 
7.9%
e 56598
 
7.9%
t 56598
 
7.9%
i 56598
 
7.9%
n 37732
 
5.3%
a 37732
 
5.3%
p 37732
 
5.3%
Other values (9) 169794
23.7%
Other Punctuation
ValueCountFrequency (%)
/ 113196
66.7%
. 37732
 
22.2%
: 18866
 
11.1%
Decimal Number
ValueCountFrequency (%)
1 18866
50.0%
0 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 716908
77.6%
Common 207526
 
22.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 94330
13.2%
m 56598
 
7.9%
c 56598
 
7.9%
r 56598
 
7.9%
e 56598
 
7.9%
t 56598
 
7.9%
i 56598
 
7.9%
n 37732
 
5.3%
a 37732
 
5.3%
p 37732
 
5.3%
Other values (9) 169794
23.7%
Common
ValueCountFrequency (%)
/ 113196
54.5%
. 37732
 
18.2%
1 18866
 
9.1%
: 18866
 
9.1%
0 18866
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 924434
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 113196
 
12.2%
o 94330
 
10.2%
m 56598
 
6.1%
c 56598
 
6.1%
r 56598
 
6.1%
e 56598
 
6.1%
t 56598
 
6.1%
i 56598
 
6.1%
. 37732
 
4.1%
n 37732
 
4.1%
Other values (14) 301856
32.7%

license
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:55.799569image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters132062
Distinct characters4
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCC0_1_0
2nd rowCC0_1_0
3rd rowCC0_1_0
4th rowCC0_1_0
5th rowCC0_1_0
ValueCountFrequency (%)
cc0_1_0 18866
100.0%
2025-01-08T18:32:55.889324image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 37732
28.6%
0 37732
28.6%
_ 37732
28.6%
1 18866
14.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 56598
42.9%
Uppercase Letter 37732
28.6%
Connector Punctuation 37732
28.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 37732
66.7%
1 18866
33.3%
Uppercase Letter
ValueCountFrequency (%)
C 37732
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 94330
71.4%
Latin 37732
 
28.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 37732
40.0%
_ 37732
40.0%
1 18866
20.0%
Latin
ValueCountFrequency (%)
C 37732
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132062
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 37732
28.6%
0 37732
28.6%
_ 37732
28.6%
1 18866
14.3%
Distinct1200
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:56.036401image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length20
Median length20
Mean length20
Min length20

Characters and Unicode

Total characters377320
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique667 ?
Unique (%)3.5%

Sample

1st row2024-10-14T12:59:55Z
2nd row2024-10-11T19:54:42Z
3rd row2024-10-11T19:54:42Z
4th row2024-10-11T19:54:42Z
5th row2024-10-11T19:54:42Z
ValueCountFrequency (%)
2024-09-17t21:33:28z 3971
21.0%
2024-10-12t17:36:53z 3555
18.8%
2024-09-29t10:06:24z 1799
 
9.5%
2024-09-23t19:57:36z 1572
 
8.3%
2024-02-19t13:33:41z 826
 
4.4%
2024-04-16t21:52:31z 553
 
2.9%
2024-04-28t21:51:52z 236
 
1.3%
2024-10-22t21:33:57z 219
 
1.2%
2023-07-18t22:00:07z 158
 
0.8%
2020-12-23t21:50:47z 157
 
0.8%
Other values (1190) 5820
30.8%
2025-01-08T18:32:56.259329image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 68042
18.0%
0 47839
12.7%
1 42959
11.4%
- 37732
10.0%
: 37732
10.0%
3 29809
7.9%
4 22029
 
5.8%
T 18866
 
5.0%
Z 18866
 
5.0%
9 13778
 
3.7%
Other values (4) 39668
10.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 264124
70.0%
Dash Punctuation 37732
 
10.0%
Other Punctuation 37732
 
10.0%
Uppercase Letter 37732
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 68042
25.8%
0 47839
18.1%
1 42959
16.3%
3 29809
11.3%
4 22029
 
8.3%
9 13778
 
5.2%
6 11557
 
4.4%
5 11299
 
4.3%
7 11126
 
4.2%
8 5686
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%
Other Punctuation
ValueCountFrequency (%)
: 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 339588
90.0%
Latin 37732
 
10.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 68042
20.0%
0 47839
14.1%
1 42959
12.7%
- 37732
11.1%
: 37732
11.1%
3 29809
8.8%
4 22029
 
6.5%
9 13778
 
4.1%
6 11557
 
3.4%
5 11299
 
3.3%
Other values (2) 16812
 
5.0%
Latin
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 377320
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 68042
18.0%
0 47839
12.7%
1 42959
11.4%
- 37732
10.0%
: 37732
10.0%
3 29809
7.9%
4 22029
 
5.8%
T 18866
 
5.0%
Z 18866
 
5.0%
9 13778
 
3.7%
Other values (4) 39668
10.5%

references
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:56.378437image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length68
Median length64
Mean length64.95473338
Min length64

Characters and Unicode

Total characters1225436
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017903
2nd rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017889
3rd rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017897
4th rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017895
5th rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017888
ValueCountFrequency (%)
http://collections.peabody.yale.edu/search/record/ypm-mam-017903 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017835 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017891 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017900 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017899 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017902 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017890 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017901 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017896 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017898 1
 
< 0.1%
Other values (18856) 18856
99.9%
2025-01-08T18:32:56.672307image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 113196
 
9.2%
/ 94330
 
7.7%
c 75464
 
6.2%
o 75464
 
6.2%
. 61101
 
5.0%
M 56598
 
4.6%
t 56598
 
4.6%
l 56598
 
4.6%
d 56598
 
4.6%
a 56598
 
4.6%
Other values (25) 522891
42.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 754640
61.6%
Other Punctuation 174297
 
14.2%
Uppercase Letter 132062
 
10.8%
Decimal Number 126705
 
10.3%
Dash Punctuation 37732
 
3.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 113196
15.0%
c 75464
10.0%
o 75464
10.0%
t 56598
 
7.5%
l 56598
 
7.5%
d 56598
 
7.5%
a 56598
 
7.5%
r 37732
 
5.0%
y 37732
 
5.0%
h 37732
 
5.0%
Other values (6) 150928
20.0%
Decimal Number
ValueCountFrequency (%)
0 44332
35.0%
1 20061
15.8%
2 9843
 
7.8%
6 8199
 
6.5%
7 8066
 
6.4%
5 8017
 
6.3%
4 7780
 
6.1%
3 7635
 
6.0%
9 6550
 
5.2%
8 6222
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
M 56598
42.9%
P 18866
 
14.3%
A 18866
 
14.3%
Y 18866
 
14.3%
R 18866
 
14.3%
Other Punctuation
ValueCountFrequency (%)
/ 94330
54.1%
. 61101
35.1%
: 18866
 
10.8%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 886702
72.4%
Common 338734
 
27.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 113196
12.8%
c 75464
 
8.5%
o 75464
 
8.5%
M 56598
 
6.4%
t 56598
 
6.4%
l 56598
 
6.4%
d 56598
 
6.4%
a 56598
 
6.4%
r 37732
 
4.3%
y 37732
 
4.3%
Other values (11) 264124
29.8%
Common
ValueCountFrequency (%)
/ 94330
27.8%
. 61101
18.0%
0 44332
13.1%
- 37732
 
11.1%
1 20061
 
5.9%
: 18866
 
5.6%
2 9843
 
2.9%
6 8199
 
2.4%
7 8066
 
2.4%
5 8017
 
2.4%
Other values (4) 28187
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1225436
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 113196
 
9.2%
/ 94330
 
7.7%
c 75464
 
6.2%
o 75464
 
6.2%
. 61101
 
5.0%
M 56598
 
4.6%
t 56598
 
4.6%
l 56598
 
4.6%
d 56598
 
4.6%
a 56598
 
4.6%
Other values (25) 522891
42.7%

rightsHolder
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:56.732553image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters358454
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYale Peabody Museum
2nd rowYale Peabody Museum
3rd rowYale Peabody Museum
4th rowYale Peabody Museum
5th rowYale Peabody Museum
ValueCountFrequency (%)
yale 18866
33.3%
peabody 18866
33.3%
museum 18866
33.3%
2025-01-08T18:32:56.833544image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 56598
15.8%
a 37732
10.5%
37732
10.5%
u 37732
10.5%
Y 18866
 
5.3%
l 18866
 
5.3%
P 18866
 
5.3%
b 18866
 
5.3%
o 18866
 
5.3%
d 18866
 
5.3%
Other values (4) 75464
21.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 264124
73.7%
Uppercase Letter 56598
 
15.8%
Space Separator 37732
 
10.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 56598
21.4%
a 37732
14.3%
u 37732
14.3%
l 18866
 
7.1%
b 18866
 
7.1%
o 18866
 
7.1%
d 18866
 
7.1%
y 18866
 
7.1%
s 18866
 
7.1%
m 18866
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%
Space Separator
ValueCountFrequency (%)
37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 320722
89.5%
Common 37732
 
10.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 56598
17.6%
a 37732
11.8%
u 37732
11.8%
Y 18866
 
5.9%
l 18866
 
5.9%
P 18866
 
5.9%
b 18866
 
5.9%
o 18866
 
5.9%
d 18866
 
5.9%
y 18866
 
5.9%
Other values (3) 56598
17.6%
Common
ValueCountFrequency (%)
37732
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 358454
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 56598
15.8%
a 37732
10.5%
37732
10.5%
u 37732
10.5%
Y 18866
 
5.3%
l 18866
 
5.3%
P 18866
 
5.3%
b 18866
 
5.3%
o 18866
 
5.3%
d 18866
 
5.3%
Other values (4) 75464
21.1%

type
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:56.881225image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters264124
Distinct characters13
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPhysicalObject
2nd rowPhysicalObject
3rd rowPhysicalObject
4th rowPhysicalObject
5th rowPhysicalObject
ValueCountFrequency (%)
physicalobject 18866
100.0%
2025-01-08T18:32:56.984427image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
c 37732
14.3%
P 18866
 
7.1%
h 18866
 
7.1%
y 18866
 
7.1%
s 18866
 
7.1%
i 18866
 
7.1%
a 18866
 
7.1%
l 18866
 
7.1%
O 18866
 
7.1%
b 18866
 
7.1%
Other values (3) 56598
21.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 226392
85.7%
Uppercase Letter 37732
 
14.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
c 37732
16.7%
h 18866
8.3%
y 18866
8.3%
s 18866
8.3%
i 18866
8.3%
a 18866
8.3%
l 18866
8.3%
b 18866
8.3%
j 18866
8.3%
e 18866
8.3%
Uppercase Letter
ValueCountFrequency (%)
P 18866
50.0%
O 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 264124
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
c 37732
14.3%
P 18866
 
7.1%
h 18866
 
7.1%
y 18866
 
7.1%
s 18866
 
7.1%
i 18866
 
7.1%
a 18866
 
7.1%
l 18866
 
7.1%
O 18866
 
7.1%
b 18866
 
7.1%
Other values (3) 56598
21.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 264124
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
c 37732
14.3%
P 18866
 
7.1%
h 18866
 
7.1%
y 18866
 
7.1%
s 18866
 
7.1%
i 18866
 
7.1%
a 18866
 
7.1%
l 18866
 
7.1%
O 18866
 
7.1%
b 18866
 
7.1%
Other values (3) 56598
21.4%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:57.026036image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters18866
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%
2025-01-08T18:32:57.124821image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18866
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

Most occurring scripts

ValueCountFrequency (%)
Common 18866
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18866
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

institutionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:57.164360image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters56598
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYPM
2nd rowYPM
3rd rowYPM
4th rowYPM
5th rowYPM
ValueCountFrequency (%)
ypm 18866
100.0%
2025-01-08T18:32:57.260244image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 56598
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 56598
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56598
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

collectionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:57.302245image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters37732
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowVZ
2nd rowVZ
3rd rowVZ
4th rowVZ
5th rowVZ
ValueCountFrequency (%)
vz 18866
100.0%
2025-01-08T18:32:57.394639image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 37732
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 37732
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37732
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

ownerInstitutionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:57.434029image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters56598
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYPM
2nd rowYPM
3rd rowYPM
4th rowYPM
5th rowYPM
ValueCountFrequency (%)
ypm 18866
100.0%
2025-01-08T18:32:57.525422image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 56598
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 56598
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56598
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

basisOfRecord
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:57.572816image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length18
Median length18
Mean length18
Min length18

Characters and Unicode

Total characters339588
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRESERVED_SPECIMEN
2nd rowPRESERVED_SPECIMEN
3rd rowPRESERVED_SPECIMEN
4th rowPRESERVED_SPECIMEN
5th rowPRESERVED_SPECIMEN
ValueCountFrequency (%)
preserved_specimen 18866
100.0%
2025-01-08T18:32:57.672643image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 94330
27.8%
P 37732
 
11.1%
R 37732
 
11.1%
S 37732
 
11.1%
V 18866
 
5.6%
D 18866
 
5.6%
_ 18866
 
5.6%
C 18866
 
5.6%
I 18866
 
5.6%
M 18866
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 320722
94.4%
Connector Punctuation 18866
 
5.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 94330
29.4%
P 37732
 
11.8%
R 37732
 
11.8%
S 37732
 
11.8%
V 18866
 
5.9%
D 18866
 
5.9%
C 18866
 
5.9%
I 18866
 
5.9%
M 18866
 
5.9%
N 18866
 
5.9%
Connector Punctuation
ValueCountFrequency (%)
_ 18866
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 320722
94.4%
Common 18866
 
5.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 94330
29.4%
P 37732
 
11.8%
R 37732
 
11.8%
S 37732
 
11.8%
V 18866
 
5.9%
D 18866
 
5.9%
C 18866
 
5.9%
I 18866
 
5.9%
M 18866
 
5.9%
N 18866
 
5.9%
Common
ValueCountFrequency (%)
_ 18866
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 339588
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 94330
27.8%
P 37732
 
11.1%
R 37732
 
11.1%
S 37732
 
11.1%
V 18866
 
5.6%
D 18866
 
5.6%
_ 18866
 
5.6%
C 18866
 
5.6%
I 18866
 
5.6%
M 18866
 
5.6%

dataGeneralizations
Text

Constant  Missing 

Distinct1
Distinct (%)1.5%
Missing18800
Missing (%)99.7%
Memory size147.5 KiB
2025-01-08T18:32:57.718625image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length27
Median length27
Mean length27
Min length27

Characters and Unicode

Total characters1782
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCoordinate data unavailable
2nd rowCoordinate data unavailable
3rd rowCoordinate data unavailable
4th rowCoordinate data unavailable
5th rowCoordinate data unavailable
ValueCountFrequency (%)
coordinate 66
33.3%
data 66
33.3%
unavailable 66
33.3%
2025-01-08T18:32:57.817755image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 396
22.2%
o 132
 
7.4%
d 132
 
7.4%
i 132
 
7.4%
n 132
 
7.4%
t 132
 
7.4%
e 132
 
7.4%
132
 
7.4%
l 132
 
7.4%
C 66
 
3.7%
Other values (4) 264
14.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1584
88.9%
Space Separator 132
 
7.4%
Uppercase Letter 66
 
3.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 396
25.0%
o 132
 
8.3%
d 132
 
8.3%
i 132
 
8.3%
n 132
 
8.3%
t 132
 
8.3%
e 132
 
8.3%
l 132
 
8.3%
r 66
 
4.2%
u 66
 
4.2%
Other values (2) 132
 
8.3%
Space Separator
ValueCountFrequency (%)
132
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1650
92.6%
Common 132
 
7.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 396
24.0%
o 132
 
8.0%
d 132
 
8.0%
i 132
 
8.0%
n 132
 
8.0%
t 132
 
8.0%
e 132
 
8.0%
l 132
 
8.0%
C 66
 
4.0%
r 66
 
4.0%
Other values (3) 198
12.0%
Common
ValueCountFrequency (%)
132
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1782
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 396
22.2%
o 132
 
7.4%
d 132
 
7.4%
i 132
 
7.4%
n 132
 
7.4%
t 132
 
7.4%
e 132
 
7.4%
132
 
7.4%
l 132
 
7.4%
C 66
 
3.7%
Other values (4) 264
14.8%

dynamicProperties
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:57.952694image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1073
Median length877
Mean length64.79444503
Min length19

Characters and Unicode

Total characters1222412
Distinct characters66
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st row{ "irn": "2495311" }
2nd row{ "irn": "2489043", "media": "1223142:2398869c-63eb-410d-8cf8-205d5aacbfcd", "mm_repository_id": "1223142" }
3rd row{ "irn": "2489051", "media": "1223150:ed40315a-fb57-4421-a251-a7ede5b38478", "mm_repository_id": "1223150" }
4th row{ "irn": "2489049", "media": "1223148:3d1eee9f-f1e6-4948-b842-640fbf489e2a", "mm_repository_id": "1223148" }
5th row{ "irn": "2489042", "media": "1223141:56aefa44-5e83-4aec-83f3-b632bc2756cf", "mm_repository_id": "1223141" }
ValueCountFrequency (%)
38111
29.9%
irn 18866
14.8%
solr_long_lat 13323
 
10.5%
original_num 6214
 
4.9%
osteo 4381
 
3.4%
mm_repository_id 455
 
0.4%
media 455
 
0.4%
related_record_links 379
 
0.3%
related_record_types 379
 
0.3%
71.273830,44.049466 311
 
0.2%
Other values (33627) 44501
34.9%
2025-01-08T18:32:58.163256image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 160284
 
13.1%
108509
 
8.9%
1 48769
 
4.0%
l 47322
 
3.9%
n 44996
 
3.7%
0 44312
 
3.6%
4 43624
 
3.6%
r 41589
 
3.4%
: 41225
 
3.4%
3 41094
 
3.4%
Other values (56) 600688
49.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 390133
31.9%
Lowercase Letter 325132
26.6%
Other Punctuation 272361
22.3%
Space Separator 108509
 
8.9%
Connector Punctuation 35286
 
2.9%
Uppercase Letter 26260
 
2.1%
Open Punctuation 23249
 
1.9%
Close Punctuation 23247
 
1.9%
Dash Punctuation 18214
 
1.5%
Math Symbol 19
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 10867
41.4%
O 8766
33.4%
A 5225
19.9%
P 820
 
3.1%
Y 404
 
1.5%
R 105
 
0.4%
H 10
 
< 0.1%
S 8
 
< 0.1%
C 8
 
< 0.1%
E 7
 
< 0.1%
Other values (11) 40
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
l 47322
14.6%
n 44996
13.8%
r 41589
12.8%
o 38912
12.0%
i 33038
10.2%
a 23318
7.2%
g 19538
6.0%
t 19299
5.9%
s 18921
 
5.8%
e 10092
 
3.1%
Other values (10) 28107
8.6%
Decimal Number
ValueCountFrequency (%)
1 48769
12.5%
0 44312
11.4%
4 43624
11.2%
3 41094
10.5%
5 40074
10.3%
7 40008
10.3%
6 38040
9.8%
2 36853
9.4%
9 29256
7.5%
8 28103
7.2%
Other Punctuation
ValueCountFrequency (%)
" 160284
58.8%
: 41225
 
15.1%
. 36322
 
13.3%
, 34528
 
12.7%
/ 1
 
< 0.1%
? 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
{ 18866
81.1%
( 4383
 
18.9%
Close Punctuation
ValueCountFrequency (%)
} 18866
81.2%
) 4381
 
18.8%
Space Separator
ValueCountFrequency (%)
108509
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 35286
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18214
100.0%
Math Symbol
ValueCountFrequency (%)
| 19
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 871020
71.3%
Latin 351392
28.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 47322
13.5%
n 44996
12.8%
r 41589
11.8%
o 38912
11.1%
i 33038
9.4%
a 23318
6.6%
g 19538
 
5.6%
t 19299
 
5.5%
s 18921
 
5.4%
M 10867
 
3.1%
Other values (31) 53592
15.3%
Common
ValueCountFrequency (%)
" 160284
18.4%
108509
12.5%
1 48769
 
5.6%
0 44312
 
5.1%
4 43624
 
5.0%
: 41225
 
4.7%
3 41094
 
4.7%
5 40074
 
4.6%
7 40008
 
4.6%
6 38040
 
4.4%
Other values (15) 265081
30.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1222412
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 160284
 
13.1%
108509
 
8.9%
1 48769
 
4.0%
l 47322
 
3.9%
n 44996
 
3.7%
0 44312
 
3.6%
4 43624
 
3.6%
r 41589
 
3.4%
: 41225
 
3.4%
3 41094
 
3.4%
Other values (56) 600688
49.1%

occurrenceID
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:58.277172image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length45
Median length45
Mean length45
Min length45

Characters and Unicode

Total characters848970
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowurn:uuid:ef710e32-eb63-4875-b9d8-f21a261c1f52
2nd rowurn:uuid:2df9a10d-0595-4c2d-bb13-43b6677a15ce
3rd rowurn:uuid:35474ea7-f956-4872-88c2-a8c56cbe9f90
4th rowurn:uuid:6eaa6b8b-f8a1-44ee-b671-1a734de9ada2
5th rowurn:uuid:b45e450f-3835-46af-be66-6494f44d014e
ValueCountFrequency (%)
urn:uuid:ef710e32-eb63-4875-b9d8-f21a261c1f52 1
 
< 0.1%
urn:uuid:7a7bd1dd-0c61-423e-8d79-316ae9466af3 1
 
< 0.1%
urn:uuid:c2221631-94d5-4364-b7a1-6e8875d768ba 1
 
< 0.1%
urn:uuid:565e73ca-2d43-4f72-bf13-66ca168617ad 1
 
< 0.1%
urn:uuid:8ebc41fa-c154-4c27-a7d4-606e62b2dc95 1
 
< 0.1%
urn:uuid:9ba9abd0-a03f-49c3-97e5-d8a6557c42bd 1
 
< 0.1%
urn:uuid:183dfe30-8155-4c5d-ae5d-15cc0b7ea3b8 1
 
< 0.1%
urn:uuid:fa9cc82d-fccf-4fb9-834c-a5e890e5ff61 1
 
< 0.1%
urn:uuid:b4b795a6-619a-4d62-b9a2-3c911f103ed3 1
 
< 0.1%
urn:uuid:a57c6e6c-6f11-465a-a440-a5953d2cf9d2 1
 
< 0.1%
Other values (18856) 18856
99.9%
2025-01-08T18:32:58.438102image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 75464
 
8.9%
u 56598
 
6.7%
4 54374
 
6.4%
d 54159
 
6.4%
8 40303
 
4.7%
9 40140
 
4.7%
b 40080
 
4.7%
a 39808
 
4.7%
: 37732
 
4.4%
f 35654
 
4.2%
Other values (12) 374658
44.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 382516
45.1%
Lowercase Letter 353258
41.6%
Dash Punctuation 75464
 
8.9%
Other Punctuation 37732
 
4.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 56598
16.0%
d 54159
15.3%
b 40080
11.3%
a 39808
11.3%
f 35654
10.1%
e 35275
10.0%
c 35086
9.9%
r 18866
 
5.3%
i 18866
 
5.3%
n 18866
 
5.3%
Decimal Number
ValueCountFrequency (%)
4 54374
14.2%
8 40303
10.5%
9 40140
10.5%
1 35621
9.3%
5 35502
9.3%
2 35421
9.3%
7 35401
9.3%
0 35374
9.2%
6 35239
9.2%
3 35141
9.2%
Dash Punctuation
ValueCountFrequency (%)
- 75464
100.0%
Other Punctuation
ValueCountFrequency (%)
: 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 495712
58.4%
Latin 353258
41.6%

Most frequent character per script

Common
ValueCountFrequency (%)
- 75464
15.2%
4 54374
11.0%
8 40303
8.1%
9 40140
8.1%
: 37732
7.6%
1 35621
7.2%
5 35502
7.2%
2 35421
7.1%
7 35401
7.1%
0 35374
7.1%
Other values (2) 70380
14.2%
Latin
ValueCountFrequency (%)
u 56598
16.0%
d 54159
15.3%
b 40080
11.3%
a 39808
11.3%
f 35654
10.1%
e 35275
10.0%
c 35086
9.9%
r 18866
 
5.3%
i 18866
 
5.3%
n 18866
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 848970
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 75464
 
8.9%
u 56598
 
6.7%
4 54374
 
6.4%
d 54159
 
6.4%
8 40303
 
4.7%
9 40140
 
4.7%
b 40080
 
4.7%
a 39808
 
4.7%
: 37732
 
4.4%
f 35654
 
4.2%
Other values (12) 374658
44.1%

catalogNumber
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:58.642106image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length18
Median length14
Mean length14.95473338
Min length14

Characters and Unicode

Total characters282136
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowYPM MAM 017903
2nd rowYPM MAM 017889
3rd rowYPM MAM 017897
4th rowYPM MAM 017895
5th rowYPM MAM 017888
ValueCountFrequency (%)
ypm 18866
33.3%
mam 18866
33.3%
015555.002 1
 
< 0.1%
017813 1
 
< 0.1%
017899 1
 
< 0.1%
017902 1
 
< 0.1%
017890 1
 
< 0.1%
017901 1
 
< 0.1%
017896 1
 
< 0.1%
017898 1
 
< 0.1%
Other values (18858) 18858
33.3%
2025-01-08T18:32:58.905327image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
M 56598
20.1%
0 44332
15.7%
37732
13.4%
1 20061
 
7.1%
Y 18866
 
6.7%
P 18866
 
6.7%
A 18866
 
6.7%
2 9843
 
3.5%
6 8199
 
2.9%
7 8066
 
2.9%
Other values (6) 40707
14.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 126705
44.9%
Uppercase Letter 113196
40.1%
Space Separator 37732
 
13.4%
Other Punctuation 4503
 
1.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 44332
35.0%
1 20061
15.8%
2 9843
 
7.8%
6 8199
 
6.5%
7 8066
 
6.4%
5 8017
 
6.3%
4 7780
 
6.1%
3 7635
 
6.0%
9 6550
 
5.2%
8 6222
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
M 56598
50.0%
Y 18866
 
16.7%
P 18866
 
16.7%
A 18866
 
16.7%
Space Separator
ValueCountFrequency (%)
37732
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4503
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 168940
59.9%
Latin 113196
40.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 44332
26.2%
37732
22.3%
1 20061
11.9%
2 9843
 
5.8%
6 8199
 
4.9%
7 8066
 
4.8%
5 8017
 
4.7%
4 7780
 
4.6%
3 7635
 
4.5%
9 6550
 
3.9%
Other values (2) 10725
 
6.3%
Latin
ValueCountFrequency (%)
M 56598
50.0%
Y 18866
 
16.7%
P 18866
 
16.7%
A 18866
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 282136
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M 56598
20.1%
0 44332
15.7%
37732
13.4%
1 20061
 
7.1%
Y 18866
 
6.7%
P 18866
 
6.7%
A 18866
 
6.7%
2 9843
 
3.5%
6 8199
 
2.9%
7 8066
 
2.9%
Other values (6) 40707
14.4%

recordedBy
Text

Missing 

Distinct1050
Distinct (%)7.2%
Missing4296
Missing (%)22.8%
Memory size147.5 KiB
2025-01-08T18:32:59.090455image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length120
Median length80
Mean length16.20549073
Min length3

Characters and Unicode

Total characters236114
Distinct characters69
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique526 ?
Unique (%)3.6%

Sample

1st rowRichard E. Boardman, Kristof Zyskowski
2nd rowRichard E. Boardman
3rd rowLourdes M. Rojas
4th rowRichard E. Boardman
5th rowRichard E. Boardman
ValueCountFrequency (%)
mariko 1875
 
4.7%
yamasaki 1875
 
4.7%
e 1394
 
3.5%
b 1115
 
2.8%
c 1091
 
2.7%
j 1070
 
2.7%
a 867
 
2.2%
ryan 849
 
2.1%
stephens 848
 
2.1%
d 830
 
2.1%
Other values (1289) 28092
70.4%
2025-01-08T18:32:59.345588image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25336
 
10.7%
a 21256
 
9.0%
e 16815
 
7.1%
r 13987
 
5.9%
i 13506
 
5.7%
o 12209
 
5.2%
n 11603
 
4.9%
. 10574
 
4.5%
l 9856
 
4.2%
s 8143
 
3.4%
Other values (59) 92829
39.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 157721
66.8%
Uppercase Letter 40869
 
17.3%
Space Separator 25336
 
10.7%
Other Punctuation 11218
 
4.8%
Decimal Number 552
 
0.2%
Dash Punctuation 416
 
0.2%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 21256
13.5%
e 16815
10.7%
r 13987
 
8.9%
i 13506
 
8.6%
o 12209
 
7.7%
n 11603
 
7.4%
l 9856
 
6.2%
s 8143
 
5.2%
t 6641
 
4.2%
m 6447
 
4.1%
Other values (17) 37258
23.6%
Uppercase Letter
ValueCountFrequency (%)
M 4521
 
11.1%
R 4515
 
11.0%
C 3379
 
8.3%
S 2982
 
7.3%
J 2726
 
6.7%
E 2666
 
6.5%
B 2557
 
6.3%
D 2059
 
5.0%
G 1997
 
4.9%
Y 1954
 
4.8%
Other values (15) 11513
28.2%
Decimal Number
ValueCountFrequency (%)
1 190
34.4%
7 70
 
12.7%
8 69
 
12.5%
9 68
 
12.3%
6 64
 
11.6%
2 43
 
7.8%
0 41
 
7.4%
3 7
 
1.3%
Other Punctuation
ValueCountFrequency (%)
. 10574
94.3%
, 565
 
5.0%
& 44
 
0.4%
' 32
 
0.3%
/ 3
 
< 0.1%
Space Separator
ValueCountFrequency (%)
25336
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 416
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 198590
84.1%
Common 37524
 
15.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 21256
 
10.7%
e 16815
 
8.5%
r 13987
 
7.0%
i 13506
 
6.8%
o 12209
 
6.1%
n 11603
 
5.8%
l 9856
 
5.0%
s 8143
 
4.1%
t 6641
 
3.3%
m 6447
 
3.2%
Other values (42) 78127
39.3%
Common
ValueCountFrequency (%)
25336
67.5%
. 10574
28.2%
, 565
 
1.5%
- 416
 
1.1%
1 190
 
0.5%
7 70
 
0.2%
8 69
 
0.2%
9 68
 
0.2%
6 64
 
0.2%
& 44
 
0.1%
Other values (7) 128
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 236040
> 99.9%
None 74
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25336
 
10.7%
a 21256
 
9.0%
e 16815
 
7.1%
r 13987
 
5.9%
i 13506
 
5.7%
o 12209
 
5.2%
n 11603
 
4.9%
. 10574
 
4.5%
l 9856
 
4.2%
s 8143
 
3.4%
Other values (58) 92755
39.3%
None
ValueCountFrequency (%)
ü 74
100.0%
Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:32:59.403043image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length1
Mean length1.000318032
Min length1

Characters and Unicode

Total characters18872
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
1 18844
99.9%
2 5
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
11 2
 
< 0.1%
17 1
 
< 0.1%
10 1
 
< 0.1%
4 1
 
< 0.1%
7 1
 
< 0.1%
5 1
 
< 0.1%
Other values (3) 3
 
< 0.1%
2025-01-08T18:32:59.507270image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18872
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 18872
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18872
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

sex
Text

Missing 

Distinct2
Distinct (%)< 0.1%
Missing10133
Missing (%)53.7%
Memory size147.5 KiB
2025-01-08T18:32:59.549001image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.911714188
Min length4

Characters and Unicode

Total characters42894
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFEMALE
2nd rowFEMALE
3rd rowMALE
4th rowMALE
5th rowFEMALE
ValueCountFrequency (%)
male 4752
54.4%
female 3981
45.6%
2025-01-08T18:32:59.648098image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 12714
29.6%
M 8733
20.4%
A 8733
20.4%
L 8733
20.4%
F 3981
 
9.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 42894
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 12714
29.6%
M 8733
20.4%
A 8733
20.4%
L 8733
20.4%
F 3981
 
9.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 42894
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 12714
29.6%
M 8733
20.4%
A 8733
20.4%
L 8733
20.4%
F 3981
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42894
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 12714
29.6%
M 8733
20.4%
A 8733
20.4%
L 8733
20.4%
F 3981
 
9.3%

lifeStage
Text

Missing 

Distinct6
Distinct (%)0.7%
Missing17963
Missing (%)95.2%
Memory size147.5 KiB
2025-01-08T18:32:59.692785image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length5
Mean length6.280177187
Min length5

Characters and Unicode

Total characters5671
Distinct characters20
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAdult
2nd rowAdult
3rd rowAdult
4th rowAdult
5th rowAdult
ValueCountFrequency (%)
adult 508
56.3%
juvenile 322
35.7%
immature 29
 
3.2%
neonate 25
 
2.8%
subadult 17
 
1.9%
embryo 2
 
0.2%
2025-01-08T18:32:59.798663image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
u 893
15.7%
l 847
14.9%
e 723
12.7%
t 579
10.2%
d 525
9.3%
A 508
9.0%
n 347
 
6.1%
J 322
 
5.7%
v 322
 
5.7%
i 322
 
5.7%
Other values (10) 283
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4768
84.1%
Uppercase Letter 903
 
15.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 893
18.7%
l 847
17.8%
e 723
15.2%
t 579
12.1%
d 525
11.0%
n 347
 
7.3%
v 322
 
6.8%
i 322
 
6.8%
a 71
 
1.5%
m 60
 
1.3%
Other values (4) 79
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
A 508
56.3%
J 322
35.7%
I 29
 
3.2%
N 25
 
2.8%
S 17
 
1.9%
E 2
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 5671
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
u 893
15.7%
l 847
14.9%
e 723
12.7%
t 579
10.2%
d 525
9.3%
A 508
9.0%
n 347
 
6.1%
J 322
 
5.7%
v 322
 
5.7%
i 322
 
5.7%
Other values (10) 283
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5671
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
u 893
15.7%
l 847
14.9%
e 723
12.7%
t 579
10.2%
d 525
9.3%
A 508
9.0%
n 347
 
6.1%
J 322
 
5.7%
v 322
 
5.7%
i 322
 
5.7%
Other values (10) 283
 
5.0%

reproductiveCondition
Text

Missing 

Distinct626
Distinct (%)27.3%
Missing16576
Missing (%)87.9%
Memory size147.5 KiB
2025-01-08T18:32:59.939368image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length166
Median length116
Mean length12.40349345
Min length2

Characters and Unicode

Total characters28404
Distinct characters70
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique457 ?
Unique (%)20.0%

Sample

1st rowtestes 5 x 2 mm
2nd rowEMB; 6; 10x8
3rd rowSCR; L=6x4
4th rowSCR R=8x5
5th rowEMB; L=4; R=2, 14X18
ValueCountFrequency (%)
testes 1006
16.2%
mm 877
 
14.1%
embryo 650
 
10.4%
no 643
 
10.3%
3 151
 
2.4%
2 137
 
2.2%
embryos 137
 
2.2%
lactating 137
 
2.2%
4 135
 
2.2%
5 111
 
1.8%
Other values (469) 2242
36.0%
2025-01-08T18:33:00.163864image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3936
13.9%
e 3252
11.4%
m 2969
 
10.5%
s 2466
 
8.7%
t 2401
 
8.5%
o 1669
 
5.9%
r 1048
 
3.7%
n 966
 
3.4%
b 824
 
2.9%
y 821
 
2.9%
Other values (60) 8052
28.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 18999
66.9%
Space Separator 3936
 
13.9%
Decimal Number 2531
 
8.9%
Uppercase Letter 1676
 
5.9%
Other Punctuation 737
 
2.6%
Math Symbol 248
 
0.9%
Dash Punctuation 229
 
0.8%
Open Punctuation 24
 
0.1%
Close Punctuation 24
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 3252
17.1%
m 2969
15.6%
s 2466
13.0%
t 2401
12.6%
o 1669
8.8%
r 1048
 
5.5%
n 966
 
5.1%
b 824
 
4.3%
y 821
 
4.3%
a 601
 
3.2%
Other values (15) 1982
10.4%
Uppercase Letter
ValueCountFrequency (%)
R 354
21.1%
T 280
16.7%
L 194
11.6%
S 157
9.4%
C 141
 
8.4%
P 128
 
7.6%
N 125
 
7.5%
A 86
 
5.1%
E 70
 
4.2%
B 49
 
2.9%
Other values (8) 92
 
5.5%
Decimal Number
ValueCountFrequency (%)
5 466
18.4%
1 447
17.7%
2 372
14.7%
3 348
13.7%
4 251
9.9%
0 199
7.9%
6 162
 
6.4%
7 104
 
4.1%
8 104
 
4.1%
9 78
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 348
47.2%
, 218
29.6%
; 113
 
15.3%
: 39
 
5.3%
" 7
 
0.9%
& 5
 
0.7%
? 3
 
0.4%
/ 3
 
0.4%
' 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
= 236
95.2%
+ 10
 
4.0%
~ 1
 
0.4%
> 1
 
0.4%
Space Separator
ValueCountFrequency (%)
3936
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 229
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20675
72.8%
Common 7729
 
27.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 3252
15.7%
m 2969
14.4%
s 2466
11.9%
t 2401
11.6%
o 1669
8.1%
r 1048
 
5.1%
n 966
 
4.7%
b 824
 
4.0%
y 821
 
4.0%
a 601
 
2.9%
Other values (33) 3658
17.7%
Common
ValueCountFrequency (%)
3936
50.9%
5 466
 
6.0%
1 447
 
5.8%
2 372
 
4.8%
3 348
 
4.5%
. 348
 
4.5%
4 251
 
3.2%
= 236
 
3.1%
- 229
 
3.0%
, 218
 
2.8%
Other values (17) 878
 
11.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3936
13.9%
e 3252
11.4%
m 2969
 
10.5%
s 2466
 
8.7%
t 2401
 
8.5%
o 1669
 
5.9%
r 1048
 
3.7%
n 966
 
3.4%
b 824
 
2.9%
y 821
 
2.9%
Other values (60) 8052
28.3%

behavior
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing18864
Missing (%)> 99.9%
Memory size147.5 KiB
2025-01-08T18:33:00.232404image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length64
Median length56.5
Mean length56.5
Min length49

Characters and Unicode

Total characters113
Distinct characters27
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowwas calling while hanging from a 0.5 m tall shrub
2nd rowwas day-roosting in a dense subcanopy tree ca. 15 m above ground
ValueCountFrequency (%)
was 2
 
9.1%
a 2
 
9.1%
m 2
 
9.1%
in 1
 
4.5%
above 1
 
4.5%
15 1
 
4.5%
ca 1
 
4.5%
tree 1
 
4.5%
subcanopy 1
 
4.5%
dense 1
 
4.5%
Other values (9) 9
40.9%
2025-01-08T18:33:00.357347image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
17.7%
a 11
 
9.7%
n 8
 
7.1%
o 6
 
5.3%
s 6
 
5.3%
e 6
 
5.3%
l 5
 
4.4%
i 5
 
4.4%
g 5
 
4.4%
r 5
 
4.4%
Other values (17) 36
31.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 86
76.1%
Space Separator 20
 
17.7%
Decimal Number 4
 
3.5%
Other Punctuation 2
 
1.8%
Dash Punctuation 1
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 11
12.8%
n 8
 
9.3%
o 6
 
7.0%
s 6
 
7.0%
e 6
 
7.0%
l 5
 
5.8%
i 5
 
5.8%
g 5
 
5.8%
r 5
 
5.8%
t 3
 
3.5%
Other values (11) 26
30.2%
Decimal Number
ValueCountFrequency (%)
5 2
50.0%
0 1
25.0%
1 1
25.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 86
76.1%
Common 27
 
23.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 11
12.8%
n 8
 
9.3%
o 6
 
7.0%
s 6
 
7.0%
e 6
 
7.0%
l 5
 
5.8%
i 5
 
5.8%
g 5
 
5.8%
r 5
 
5.8%
t 3
 
3.5%
Other values (11) 26
30.2%
Common
ValueCountFrequency (%)
20
74.1%
. 2
 
7.4%
5 2
 
7.4%
0 1
 
3.7%
- 1
 
3.7%
1 1
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 113
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20
17.7%
a 11
 
9.7%
n 8
 
7.1%
o 6
 
5.3%
s 6
 
5.3%
e 6
 
5.3%
l 5
 
4.4%
i 5
 
4.4%
g 5
 
4.4%
r 5
 
4.4%
Other values (17) 36
31.9%

occurrenceStatus
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:00.402345image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters132062
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRESENT
2nd rowPRESENT
3rd rowPRESENT
4th rowPRESENT
5th rowPRESENT
ValueCountFrequency (%)
present 18866
100.0%
2025-01-08T18:33:00.498307image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 37732
28.6%
P 18866
14.3%
R 18866
14.3%
S 18866
14.3%
N 18866
14.3%
T 18866
14.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 132062
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 37732
28.6%
P 18866
14.3%
R 18866
14.3%
S 18866
14.3%
N 18866
14.3%
T 18866
14.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 132062
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 37732
28.6%
P 18866
14.3%
R 18866
14.3%
S 18866
14.3%
N 18866
14.3%
T 18866
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132062
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 37732
28.6%
P 18866
14.3%
R 18866
14.3%
S 18866
14.3%
N 18866
14.3%
T 18866
14.3%

preparations
Text

Missing 

Distinct1019
Distinct (%)5.5%
Missing349
Missing (%)1.8%
Memory size147.5 KiB
2025-01-08T18:33:00.676861image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length262
Median length190
Mean length25.19781822
Min length4

Characters and Unicode

Total characters466588
Distinct characters80
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique762 ?
Unique (%)4.1%

Sample

1st rowskin, round; skull; tissue (frozen)
2nd rowtissue (frozen)
3rd rowtissue (frozen)
4th rowtissue (frozen)
5th rowtissue (frozen)
ValueCountFrequency (%)
skeleton 13111
20.4%
skull 8315
12.9%
only 7454
11.6%
skin 6927
10.8%
round 5887
9.2%
tissue 4575
 
7.1%
frozen 4435
 
6.9%
incomplete 1443
 
2.2%
alc 1212
 
1.9%
10 1172
 
1.8%
Other values (1014) 9705
15.1%
2025-01-08T18:33:00.919269image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45719
 
9.8%
n 43643
 
9.4%
e 42409
 
9.1%
l 42371
 
9.1%
s 40001
 
8.6%
o 35814
 
7.7%
k 28618
 
6.1%
t 22389
 
4.8%
u 19801
 
4.2%
i 16215
 
3.5%
Other values (70) 129608
27.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 352064
75.5%
Space Separator 45719
 
9.8%
Other Punctuation 21809
 
4.7%
Close Punctuation 15985
 
3.4%
Open Punctuation 15984
 
3.4%
Decimal Number 7890
 
1.7%
Uppercase Letter 4747
 
1.0%
Dash Punctuation 1217
 
0.3%
Math Symbol 1173
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 43643
12.4%
e 42409
12.0%
l 42371
12.0%
s 40001
11.4%
o 35814
10.2%
k 28618
8.1%
t 22389
6.4%
u 19801
5.6%
i 16215
 
4.6%
r 13228
 
3.8%
Other values (16) 47575
13.5%
Uppercase Letter
ValueCountFrequency (%)
C 1139
24.0%
S 633
13.3%
L 485
10.2%
O 365
 
7.7%
M 258
 
5.4%
R 249
 
5.2%
I 230
 
4.8%
T 229
 
4.8%
E 213
 
4.5%
D 125
 
2.6%
Other values (16) 821
17.3%
Other Punctuation
ValueCountFrequency (%)
; 7821
35.9%
, 7453
34.2%
. 3527
16.2%
% 2384
 
10.9%
/ 316
 
1.4%
" 272
 
1.2%
& 18
 
0.1%
' 10
 
< 0.1%
? 6
 
< 0.1%
: 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 2841
36.0%
1 1930
24.5%
7 1409
17.9%
3 690
 
8.7%
2 329
 
4.2%
5 229
 
2.9%
4 186
 
2.4%
6 112
 
1.4%
8 99
 
1.3%
9 65
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 15984
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 15983
> 99.9%
[ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
> 1172
99.9%
+ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
45719
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1217
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 356811
76.5%
Common 109777
 
23.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 43643
12.2%
e 42409
11.9%
l 42371
11.9%
s 40001
11.2%
o 35814
10.0%
k 28618
8.0%
t 22389
6.3%
u 19801
 
5.5%
i 16215
 
4.5%
r 13228
 
3.7%
Other values (42) 52322
14.7%
Common
ValueCountFrequency (%)
45719
41.6%
) 15984
 
14.6%
( 15983
 
14.6%
; 7821
 
7.1%
, 7453
 
6.8%
. 3527
 
3.2%
0 2841
 
2.6%
% 2384
 
2.2%
1 1930
 
1.8%
7 1409
 
1.3%
Other values (18) 4726
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 466588
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45719
 
9.8%
n 43643
 
9.4%
e 42409
 
9.1%
l 42371
 
9.1%
s 40001
 
8.6%
o 35814
 
7.7%
k 28618
 
6.1%
t 22389
 
4.8%
u 19801
 
4.2%
i 16215
 
3.5%
Other values (70) 129608
27.8%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:00.976658image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.98484045
Min length7

Characters and Unicode

Total characters244972
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowin collection
2nd rowin collection
3rd rowin collection
4th rowin collection
5th rowin collection
ValueCountFrequency (%)
in 18804
49.8%
collection 18804
49.8%
on 62
 
0.2%
loan 38
 
0.1%
not 14
 
< 0.1%
view 14
 
< 0.1%
exhibit 10
 
< 0.1%
2025-01-08T18:33:01.081365image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 37722
15.4%
o 37722
15.4%
l 37646
15.4%
i 37642
15.4%
c 37608
15.4%
18880
7.7%
e 18828
7.7%
t 18828
7.7%
a 38
 
< 0.1%
v 14
 
< 0.1%
Other values (4) 44
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 226092
92.3%
Space Separator 18880
 
7.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 37722
16.7%
o 37722
16.7%
l 37646
16.7%
i 37642
16.6%
c 37608
16.6%
e 18828
8.3%
t 18828
8.3%
a 38
 
< 0.1%
v 14
 
< 0.1%
w 14
 
< 0.1%
Other values (3) 30
 
< 0.1%
Space Separator
ValueCountFrequency (%)
18880
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 226092
92.3%
Common 18880
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 37722
16.7%
o 37722
16.7%
l 37646
16.7%
i 37642
16.6%
c 37608
16.6%
e 18828
8.3%
t 18828
8.3%
a 38
 
< 0.1%
v 14
 
< 0.1%
w 14
 
< 0.1%
Other values (3) 30
 
< 0.1%
Common
ValueCountFrequency (%)
18880
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 244972
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 37722
15.4%
o 37722
15.4%
l 37646
15.4%
i 37642
15.4%
c 37608
15.4%
18880
7.7%
e 18828
7.7%
t 18828
7.7%
a 38
 
< 0.1%
v 14
 
< 0.1%
Other values (4) 44
 
< 0.1%

associatedReferences
Text

Missing 

Distinct178
Distinct (%)2.8%
Missing12450
Missing (%)66.0%
Memory size147.5 KiB
2025-01-08T18:33:01.194730image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length116
Median length1
Mean length8.085099751
Min length1

Characters and Unicode

Total characters51874
Distinct characters65
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)1.2%

Sample

1st row|
2nd row|
3rd row|
4th row|
5th row|
ValueCountFrequency (%)
4933
37.6%
by 1565
 
11.9%
det 1461
 
11.1%
kristof 303
 
2.3%
jordan 300
 
2.3%
g 300
 
2.3%
colosi 300
 
2.3%
a 296
 
2.3%
zyskowski 291
 
2.2%
mary 288
 
2.2%
Other values (171) 3078
23.5%
2025-01-08T18:33:01.381078image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
| 7591
 
14.6%
6699
 
12.9%
e 2799
 
5.4%
. 2603
 
5.0%
r 2373
 
4.6%
y 2262
 
4.4%
t 2241
 
4.3%
o 2076
 
4.0%
b 1743
 
3.4%
D 1738
 
3.4%
Other values (55) 19749
38.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 23551
45.4%
Math Symbol 7591
 
14.6%
Space Separator 6699
 
12.9%
Uppercase Letter 5999
 
11.6%
Other Punctuation 4176
 
8.1%
Decimal Number 3818
 
7.4%
Dash Punctuation 40
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 2799
11.9%
r 2373
10.1%
y 2262
9.6%
t 2241
9.5%
o 2076
8.8%
b 1743
7.4%
s 1614
 
6.9%
i 1410
 
6.0%
a 1357
 
5.8%
n 1339
 
5.7%
Other values (15) 4337
18.4%
Uppercase Letter
ValueCountFrequency (%)
D 1738
29.0%
A 601
 
10.0%
K 503
 
8.4%
C 448
 
7.5%
J 435
 
7.3%
M 426
 
7.1%
G 353
 
5.9%
T 326
 
5.4%
Z 303
 
5.1%
N 129
 
2.2%
Other values (13) 737
12.3%
Decimal Number
ValueCountFrequency (%)
0 1658
43.4%
2 1138
29.8%
8 277
 
7.3%
9 275
 
7.2%
1 251
 
6.6%
7 132
 
3.5%
6 33
 
0.9%
4 25
 
0.7%
3 21
 
0.6%
5 8
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 2603
62.3%
: 1569
37.6%
; 3
 
0.1%
, 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
| 7591
100.0%
Space Separator
ValueCountFrequency (%)
6699
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 29550
57.0%
Common 22324
43.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 2799
 
9.5%
r 2373
 
8.0%
y 2262
 
7.7%
t 2241
 
7.6%
o 2076
 
7.0%
b 1743
 
5.9%
D 1738
 
5.9%
s 1614
 
5.5%
i 1410
 
4.8%
a 1357
 
4.6%
Other values (38) 9937
33.6%
Common
ValueCountFrequency (%)
| 7591
34.0%
6699
30.0%
. 2603
 
11.7%
0 1658
 
7.4%
: 1569
 
7.0%
2 1138
 
5.1%
8 277
 
1.2%
9 275
 
1.2%
1 251
 
1.1%
7 132
 
0.6%
Other values (7) 131
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 51873
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
| 7591
 
14.6%
6699
 
12.9%
e 2799
 
5.4%
. 2603
 
5.0%
r 2373
 
4.6%
y 2262
 
4.4%
t 2241
 
4.3%
o 2076
 
4.0%
b 1743
 
3.4%
D 1738
 
3.4%
Other values (54) 19748
38.1%
None
ValueCountFrequency (%)
é 1
100.0%

associatedTaxa
Text

Missing 

Distinct373
Distinct (%)98.4%
Missing18487
Missing (%)98.0%
Memory size147.5 KiB
2025-01-08T18:33:01.570025image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length131
Median length10
Mean length13.77572559
Min length10

Characters and Unicode

Total characters5221
Distinct characters44
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique371 ?
Unique (%)97.9%

Sample

1st rowENT.013766
2nd rowoffspring: MAM.015755
3rd rowparent: MAM.015754
4th rowMAM.001438
5th rowMAM.004953
ValueCountFrequency (%)
part 39
 
6.9%
same 36
 
6.3%
specimen 36
 
6.3%
of 36
 
6.3%
other 8
 
1.4%
parent 7
 
1.2%
mam.012670 6
 
1.1%
skeleton 3
 
0.5%
mam.013246|part 3
 
0.5%
mam.013247|part 3
 
0.5%
Other values (381) 392
68.9%
2025-01-08T18:33:01.911849image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 884
16.9%
M 775
14.8%
. 402
 
7.7%
A 391
 
7.5%
1 344
 
6.6%
3 195
 
3.7%
190
 
3.6%
9 173
 
3.3%
2 166
 
3.2%
5 148
 
2.8%
Other values (34) 1553
29.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2394
45.9%
Uppercase Letter 1206
23.1%
Lowercase Letter 902
 
17.3%
Other Punctuation 510
 
9.8%
Space Separator 190
 
3.6%
Math Symbol 19
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 130
14.4%
p 100
11.1%
a 98
10.9%
s 85
9.4%
r 72
8.0%
m 72
8.0%
t 70
7.8%
n 59
6.5%
o 54
6.0%
f 50
 
5.5%
Other values (7) 112
12.4%
Uppercase Letter
ValueCountFrequency (%)
M 775
64.3%
A 391
32.4%
H 7
 
0.6%
E 6
 
0.5%
X 5
 
0.4%
Y 5
 
0.4%
P 5
 
0.4%
R 5
 
0.4%
T 3
 
0.2%
S 2
 
0.2%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 884
36.9%
1 344
 
14.4%
3 195
 
8.1%
9 173
 
7.2%
2 166
 
6.9%
5 148
 
6.2%
4 133
 
5.6%
6 124
 
5.2%
8 114
 
4.8%
7 113
 
4.7%
Other Punctuation
ValueCountFrequency (%)
. 402
78.8%
: 75
 
14.7%
? 33
 
6.5%
Space Separator
ValueCountFrequency (%)
190
100.0%
Math Symbol
ValueCountFrequency (%)
| 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3113
59.6%
Latin 2108
40.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 775
36.8%
A 391
18.5%
e 130
 
6.2%
p 100
 
4.7%
a 98
 
4.6%
s 85
 
4.0%
r 72
 
3.4%
m 72
 
3.4%
t 70
 
3.3%
n 59
 
2.8%
Other values (19) 256
 
12.1%
Common
ValueCountFrequency (%)
0 884
28.4%
. 402
12.9%
1 344
 
11.1%
3 195
 
6.3%
190
 
6.1%
9 173
 
5.6%
2 166
 
5.3%
5 148
 
4.8%
4 133
 
4.3%
6 124
 
4.0%
Other values (5) 354
11.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5221
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 884
16.9%
M 775
14.8%
. 402
 
7.7%
A 391
 
7.5%
1 344
 
6.6%
3 195
 
3.7%
190
 
3.6%
9 173
 
3.3%
2 166
 
3.2%
5 148
 
2.8%
Other values (34) 1553
29.7%

otherCatalogNumbers
Text

Missing 

Distinct6197
Distinct (%)99.7%
Missing12652
Missing (%)67.1%
Memory size147.5 KiB
2025-01-08T18:33:02.105289image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length224
Median length124
Mean length20.11812037
Min length3

Characters and Unicode

Total characters125014
Distinct characters55
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6180 ?
Unique (%)99.5%

Sample

1st rowOsteo 12753 (MAM.O.12753)
2nd rowOsteo 2583 (MAM.O.02583)
3rd rowOsteo 3875 (MAM.O.03875)
4th rowVP.061504
5th rowUAM 112553
ValueCountFrequency (%)
osteo 4413
28.6%
m 6
 
< 0.1%
dcm 5
 
< 0.1%
uam 5
 
< 0.1%
14305 2
 
< 0.1%
13629 2
 
< 0.1%
9529 2
 
< 0.1%
13506 2
 
< 0.1%
54886 2
 
< 0.1%
13739 2
 
< 0.1%
Other values (10870) 11001
71.2%
2025-01-08T18:33:02.365858image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 10298
 
8.2%
M 10211
 
8.2%
9228
 
7.4%
O 9191
 
7.4%
1 8952
 
7.2%
0 8355
 
6.7%
3 6022
 
4.8%
4 5451
 
4.4%
2 5128
 
4.1%
A 5092
 
4.1%
Other values (45) 47086
37.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 51869
41.5%
Uppercase Letter 25108
20.1%
Lowercase Letter 18707
 
15.0%
Other Punctuation 10738
 
8.6%
Space Separator 9228
 
7.4%
Open Punctuation 4595
 
3.7%
Close Punctuation 4593
 
3.7%
Dash Punctuation 176
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 10211
40.7%
O 9191
36.6%
A 5092
20.3%
P 463
 
1.8%
R 100
 
0.4%
C 9
 
< 0.1%
D 6
 
< 0.1%
S 6
 
< 0.1%
Z 6
 
< 0.1%
U 5
 
< 0.1%
Other values (10) 19
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
e 4601
24.6%
s 4598
24.6%
o 4597
24.6%
t 4597
24.6%
m 147
 
0.8%
a 83
 
0.4%
p 72
 
0.4%
l 2
 
< 0.1%
r 2
 
< 0.1%
c 2
 
< 0.1%
Other values (6) 6
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 8952
17.3%
0 8355
16.1%
3 6022
11.6%
4 5451
10.5%
2 5128
9.9%
5 4090
7.9%
9 3811
7.3%
6 3503
 
6.8%
7 3393
 
6.5%
8 3164
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 10298
95.9%
; 436
 
4.1%
" 2
 
< 0.1%
? 1
 
< 0.1%
/ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
9228
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4595
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4593
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 176
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 81199
65.0%
Latin 43815
35.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 10211
23.3%
O 9191
21.0%
A 5092
11.6%
e 4601
10.5%
s 4598
10.5%
o 4597
10.5%
t 4597
10.5%
P 463
 
1.1%
m 147
 
0.3%
R 100
 
0.2%
Other values (26) 218
 
0.5%
Common
ValueCountFrequency (%)
. 10298
12.7%
9228
11.4%
1 8952
11.0%
0 8355
10.3%
3 6022
7.4%
4 5451
 
6.7%
2 5128
 
6.3%
( 4595
 
5.7%
) 4593
 
5.7%
5 4090
 
5.0%
Other values (9) 14487
17.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 125014
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 10298
 
8.2%
M 10211
 
8.2%
9228
 
7.4%
O 9191
 
7.4%
1 8952
 
7.2%
0 8355
 
6.7%
3 6022
 
4.8%
4 5451
 
4.4%
2 5128
 
4.1%
A 5092
 
4.1%
Other values (45) 47086
37.7%
Distinct18842
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:02.571002image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length654
Median length580
Mean length69.48706668
Min length13

Characters and Unicode

Total characters1310943
Distinct characters84
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18818 ?
Unique (%)99.7%

Sample

1st rowMAM number 17903; female; personal specimen number MFH 162; testes 5 x 2 mm
2nd rowMAM number 17889; female
3rd rowMAM number 17897; male
4th rowMAM number 17895; male
5th rowMAM number 17888; female
ValueCountFrequency (%)
number 29739
 
16.2%
mam 18873
 
10.3%
original 6652
 
3.6%
catalog 6652
 
3.6%
male 5021
 
2.7%
osteo 4618
 
2.5%
specimen 4419
 
2.4%
personal 4201
 
2.3%
female 4185
 
2.3%
accn=ypm.12236 2399
 
1.3%
Other values (25895) 96369
52.6%
2025-01-08T18:33:02.839383image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164262
 
12.5%
e 84610
 
6.5%
n 72932
 
5.6%
a 67392
 
5.1%
M 58097
 
4.4%
m 52492
 
4.0%
r 52119
 
4.0%
c 44063
 
3.4%
1 42870
 
3.3%
o 40086
 
3.1%
Other values (74) 632020
48.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 689220
52.6%
Decimal Number 220996
 
16.9%
Space Separator 164262
 
12.5%
Uppercase Letter 139813
 
10.7%
Other Punctuation 71758
 
5.5%
Math Symbol 13316
 
1.0%
Open Punctuation 5176
 
0.4%
Close Punctuation 5170
 
0.4%
Dash Punctuation 1232
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 84610
12.3%
n 72932
10.6%
a 67392
9.8%
m 52492
 
7.6%
r 52119
 
7.6%
c 44063
 
6.4%
o 40086
 
5.8%
l 39877
 
5.8%
u 37915
 
5.5%
b 35664
 
5.2%
Other values (16) 162070
23.5%
Uppercase Letter
ValueCountFrequency (%)
M 58097
41.6%
A 29360
21.0%
P 10732
 
7.7%
O 9348
 
6.7%
Y 8914
 
6.4%
V 3906
 
2.8%
Z 3819
 
2.7%
R 3460
 
2.5%
S 2466
 
1.8%
B 1899
 
1.4%
Other values (16) 7812
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 42870
19.4%
0 35267
16.0%
2 23803
10.8%
3 21594
9.8%
4 21563
9.8%
6 20089
9.1%
5 16601
 
7.5%
7 14226
 
6.4%
9 13052
 
5.9%
8 11931
 
5.4%
Other Punctuation
ValueCountFrequency (%)
; 39221
54.7%
. 27314
38.1%
, 4064
 
5.7%
: 788
 
1.1%
/ 142
 
0.2%
? 123
 
0.2%
" 56
 
0.1%
' 32
 
< 0.1%
& 18
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 13301
99.9%
+ 10
 
0.1%
± 3
 
< 0.1%
~ 1
 
< 0.1%
> 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 5170
99.9%
[ 5
 
0.1%
{ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 5164
99.9%
] 5
 
0.1%
} 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
164262
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1232
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 829033
63.2%
Common 481910
36.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 84610
 
10.2%
n 72932
 
8.8%
a 67392
 
8.1%
M 58097
 
7.0%
m 52492
 
6.3%
r 52119
 
6.3%
c 44063
 
5.3%
o 40086
 
4.8%
l 39877
 
4.8%
u 37915
 
4.6%
Other values (42) 279450
33.7%
Common
ValueCountFrequency (%)
164262
34.1%
1 42870
 
8.9%
; 39221
 
8.1%
0 35267
 
7.3%
. 27314
 
5.7%
2 23803
 
4.9%
3 21594
 
4.5%
4 21563
 
4.5%
6 20089
 
4.2%
5 16601
 
3.4%
Other values (22) 69326
14.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1310940
> 99.9%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164262
 
12.5%
e 84610
 
6.5%
n 72932
 
5.6%
a 67392
 
5.1%
M 58097
 
4.4%
m 52492
 
4.0%
r 52119
 
4.0%
c 44063
 
3.4%
1 42870
 
3.3%
o 40086
 
3.1%
Other values (73) 632017
48.2%
None
ValueCountFrequency (%)
± 3
100.0%
Distinct3180
Distinct (%)17.0%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:02.996967image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length135
Median length105
Mean length29.92679278
Min length3

Characters and Unicode

Total characters560050
Distinct characters55
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1569 ?
Unique (%)8.4%

Sample

1st rowTamias striatus fisheri
2nd rowPeromyscus leucopus noveboracensis
3rd rowPeromyscus leucopus noveboracensis
4th rowPeromyscus leucopus noveboracensis
5th rowPeromyscus leucopus noveboracensis
ValueCountFrequency (%)
peromyscus 1837
 
3.4%
gapperi 1530
 
2.8%
cinereus 1460
 
2.7%
brevicauda 1361
 
2.5%
sorex 1193
 
2.2%
blarina 976
 
1.8%
maniculatus 919
 
1.7%
zibethicus 906
 
1.7%
leucopus 836
 
1.6%
talpoides 759
 
1.4%
Other values (3630) 42002
78.1%
2025-01-08T18:33:03.233000image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 54787
 
9.8%
i 49194
 
8.8%
a 47829
 
8.5%
u 39741
 
7.1%
e 39634
 
7.1%
r 35507
 
6.3%
35065
 
6.3%
o 33086
 
5.9%
n 28079
 
5.0%
c 25884
 
4.6%
Other values (45) 171244
30.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 491072
87.7%
Space Separator 35065
 
6.3%
Uppercase Letter 26308
 
4.7%
Math Symbol 7591
 
1.4%
Other Punctuation 10
 
< 0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 54787
11.2%
i 49194
10.0%
a 47829
9.7%
u 39741
 
8.1%
e 39634
 
8.1%
r 35507
 
7.2%
o 33086
 
6.7%
n 28079
 
5.7%
c 25884
 
5.3%
l 22447
 
4.6%
Other values (16) 114884
23.4%
Uppercase Letter
ValueCountFrequency (%)
P 3888
14.8%
C 3681
14.0%
M 3187
12.1%
S 2683
10.2%
B 1796
 
6.8%
T 1608
 
6.1%
O 1561
 
5.9%
N 1123
 
4.3%
A 925
 
3.5%
L 925
 
3.5%
Other values (14) 4931
18.7%
Other Punctuation
ValueCountFrequency (%)
. 8
80.0%
? 2
 
20.0%
Space Separator
ValueCountFrequency (%)
35065
100.0%
Math Symbol
ValueCountFrequency (%)
| 7591
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 517380
92.4%
Common 42670
 
7.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 54787
 
10.6%
i 49194
 
9.5%
a 47829
 
9.2%
u 39741
 
7.7%
e 39634
 
7.7%
r 35507
 
6.9%
o 33086
 
6.4%
n 28079
 
5.4%
c 25884
 
5.0%
l 22447
 
4.3%
Other values (40) 141192
27.3%
Common
ValueCountFrequency (%)
35065
82.2%
| 7591
 
17.8%
. 8
 
< 0.1%
- 4
 
< 0.1%
? 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 560050
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 54787
 
9.8%
i 49194
 
8.8%
a 47829
 
8.5%
u 39741
 
7.1%
e 39634
 
7.1%
r 35507
 
6.3%
35065
 
6.3%
o 33086
 
5.9%
n 28079
 
5.0%
c 25884
 
4.6%
Other values (45) 171244
30.6%

fieldNumber
Text

Missing 

Distinct5159
Distinct (%)70.6%
Missing11555
Missing (%)61.2%
Memory size147.5 KiB
2025-01-08T18:33:03.406532image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length16
Mean length4.113664341
Min length1

Characters and Unicode

Total characters30075
Distinct characters68
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4249 ?
Unique (%)58.1%

Sample

1st row14251
2nd rowP5
3rd rowP14
4th rowP12
5th rowP4
ValueCountFrequency (%)
f 452
 
5.3%
r 169
 
2.0%
l 162
 
1.9%
mcz 50
 
0.6%
2 44
 
0.5%
3 43
 
0.5%
1 42
 
0.5%
5 38
 
0.4%
jas 32
 
0.4%
4 31
 
0.4%
Other values (4656) 7419
87.5%
2025-01-08T18:33:03.630277image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 4503
15.0%
3 2724
9.1%
4 2723
9.1%
2 2604
8.7%
0 2480
8.2%
8 2138
 
7.1%
9 1836
 
6.1%
7 1835
 
6.1%
5 1834
 
6.1%
6 1742
 
5.8%
Other values (58) 5656
18.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24419
81.2%
Uppercase Letter 3351
 
11.1%
Space Separator 1171
 
3.9%
Dash Punctuation 829
 
2.8%
Lowercase Letter 148
 
0.5%
Open Punctuation 53
 
0.2%
Close Punctuation 53
 
0.2%
Other Punctuation 51
 
0.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
F 853
25.5%
R 343
10.2%
Q 331
 
9.9%
A 229
 
6.8%
M 198
 
5.9%
L 178
 
5.3%
B 172
 
5.1%
Z 145
 
4.3%
C 133
 
4.0%
P 131
 
3.9%
Other values (16) 638
19.0%
Lowercase Letter
ValueCountFrequency (%)
a 24
16.2%
m 17
11.5%
l 16
10.8%
e 13
8.8%
o 12
 
8.1%
t 10
 
6.8%
r 8
 
5.4%
i 7
 
4.7%
n 6
 
4.1%
p 5
 
3.4%
Other values (10) 30
20.3%
Decimal Number
ValueCountFrequency (%)
1 4503
18.4%
3 2724
11.2%
4 2723
11.2%
2 2604
10.7%
0 2480
10.2%
8 2138
8.8%
9 1836
7.5%
7 1835
7.5%
5 1834
7.5%
6 1742
 
7.1%
Other Punctuation
ValueCountFrequency (%)
. 29
56.9%
? 9
 
17.6%
/ 7
 
13.7%
# 3
 
5.9%
; 2
 
3.9%
: 1
 
2.0%
Open Punctuation
ValueCountFrequency (%)
[ 52
98.1%
( 1
 
1.9%
Close Punctuation
ValueCountFrequency (%)
] 52
98.1%
) 1
 
1.9%
Space Separator
ValueCountFrequency (%)
1171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 829
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26576
88.4%
Latin 3499
 
11.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
F 853
24.4%
R 343
9.8%
Q 331
 
9.5%
A 229
 
6.5%
M 198
 
5.7%
L 178
 
5.1%
B 172
 
4.9%
Z 145
 
4.1%
C 133
 
3.8%
P 131
 
3.7%
Other values (36) 786
22.5%
Common
ValueCountFrequency (%)
1 4503
16.9%
3 2724
10.2%
4 2723
10.2%
2 2604
9.8%
0 2480
9.3%
8 2138
8.0%
9 1836
6.9%
7 1835
6.9%
5 1834
6.9%
6 1742
 
6.6%
Other values (12) 2157
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30075
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 4503
15.0%
3 2724
9.1%
4 2723
9.1%
2 2604
8.7%
0 2480
8.2%
8 2138
 
7.1%
9 1836
 
6.1%
7 1835
 
6.1%
5 1834
 
6.1%
6 1742
 
5.8%
Other values (58) 5656
18.8%

eventDate
Text

Missing 

Distinct3828
Distinct (%)31.1%
Missing6567
Missing (%)34.8%
Memory size147.5 KiB
2025-01-08T18:33:03.828927image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length10
Mean length9.544759737
Min length4

Characters and Unicode

Total characters117391
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2041 ?
Unique (%)16.6%

Sample

1st row2024-08-15
2nd row2023-12-01
3rd row2023-12-28
4th row2023-12-20
5th row2023-11-30
ValueCountFrequency (%)
2012-07-18 178
 
1.4%
2012-07-15 170
 
1.4%
1959 163
 
1.3%
2012-07-16 150
 
1.2%
2012-07-24 144
 
1.2%
2013-08-02 109
 
0.9%
2020-10-07 108
 
0.9%
2020-10-14 100
 
0.8%
2020-10-15 96
 
0.8%
2020-10-08 96
 
0.8%
Other values (3818) 10985
89.3%
2025-01-08T18:33:04.089663image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 22585
19.2%
0 21802
18.6%
1 20655
17.6%
2 12689
10.8%
9 10314
8.8%
7 5694
 
4.9%
6 5638
 
4.8%
5 5198
 
4.4%
3 4638
 
4.0%
8 4585
 
3.9%
Other values (2) 3593
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 94718
80.7%
Dash Punctuation 22585
 
19.2%
Other Punctuation 88
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 21802
23.0%
1 20655
21.8%
2 12689
13.4%
9 10314
10.9%
7 5694
 
6.0%
6 5638
 
6.0%
5 5198
 
5.5%
3 4638
 
4.9%
8 4585
 
4.8%
4 3505
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 22585
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 88
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 117391
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 22585
19.2%
0 21802
18.6%
1 20655
17.6%
2 12689
10.8%
9 10314
8.8%
7 5694
 
4.9%
6 5638
 
4.8%
5 5198
 
4.4%
3 4638
 
4.0%
8 4585
 
3.9%
Other values (2) 3593
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 117391
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 22585
19.2%
0 21802
18.6%
1 20655
17.6%
2 12689
10.8%
9 10314
8.8%
7 5694
 
4.9%
6 5638
 
4.8%
5 5198
 
4.4%
3 4638
 
4.0%
8 4585
 
3.9%
Other values (2) 3593
 
3.1%

startDayOfYear
Text

Missing 

Distinct366
Distinct (%)3.3%
Missing7901
Missing (%)41.9%
Memory size147.5 KiB
2025-01-08T18:33:04.290178image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.816598267
Min length1

Characters and Unicode

Total characters30884
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row228
2nd row335
3rd row362
4th row354
5th row334
ValueCountFrequency (%)
200 253
 
2.3%
197 230
 
2.1%
198 219
 
2.0%
206 207
 
1.9%
214 178
 
1.6%
190 131
 
1.2%
194 123
 
1.1%
281 121
 
1.1%
282 113
 
1.0%
288 113
 
1.0%
Other values (356) 9277
84.6%
2025-01-08T18:33:04.550830image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6647
21.5%
1 5594
18.1%
3 3147
10.2%
9 2617
 
8.5%
0 2608
 
8.4%
8 2580
 
8.4%
7 2170
 
7.0%
4 1963
 
6.4%
5 1825
 
5.9%
6 1733
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30884
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6647
21.5%
1 5594
18.1%
3 3147
10.2%
9 2617
 
8.5%
0 2608
 
8.4%
8 2580
 
8.4%
7 2170
 
7.0%
4 1963
 
6.4%
5 1825
 
5.9%
6 1733
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
Common 30884
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 6647
21.5%
1 5594
18.1%
3 3147
10.2%
9 2617
 
8.5%
0 2608
 
8.4%
8 2580
 
8.4%
7 2170
 
7.0%
4 1963
 
6.4%
5 1825
 
5.9%
6 1733
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30884
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6647
21.5%
1 5594
18.1%
3 3147
10.2%
9 2617
 
8.5%
0 2608
 
8.4%
8 2580
 
8.4%
7 2170
 
7.0%
4 1963
 
6.4%
5 1825
 
5.9%
6 1733
 
5.6%

endDayOfYear
Text

Missing 

Distinct366
Distinct (%)3.3%
Missing7901
Missing (%)41.9%
Memory size147.5 KiB
2025-01-08T18:33:04.747783image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.816415869
Min length1

Characters and Unicode

Total characters30882
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row228
2nd row335
3rd row362
4th row354
5th row334
ValueCountFrequency (%)
200 253
 
2.3%
197 230
 
2.1%
198 219
 
2.0%
206 207
 
1.9%
214 178
 
1.6%
190 131
 
1.2%
194 123
 
1.1%
281 121
 
1.1%
282 113
 
1.0%
288 113
 
1.0%
Other values (356) 9277
84.6%
2025-01-08T18:33:05.000433image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6646
21.5%
1 5597
18.1%
3 3140
10.2%
9 2609
 
8.4%
0 2608
 
8.4%
8 2544
 
8.2%
7 2202
 
7.1%
4 1921
 
6.2%
5 1856
 
6.0%
6 1759
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30882
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6646
21.5%
1 5597
18.1%
3 3140
10.2%
9 2609
 
8.4%
0 2608
 
8.4%
8 2544
 
8.2%
7 2202
 
7.1%
4 1921
 
6.2%
5 1856
 
6.0%
6 1759
 
5.7%

Most occurring scripts

ValueCountFrequency (%)
Common 30882
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 6646
21.5%
1 5597
18.1%
3 3140
10.2%
9 2609
 
8.4%
0 2608
 
8.4%
8 2544
 
8.2%
7 2202
 
7.1%
4 1921
 
6.2%
5 1856
 
6.0%
6 1759
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30882
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6646
21.5%
1 5597
18.1%
3 3140
10.2%
9 2609
 
8.4%
0 2608
 
8.4%
8 2544
 
8.2%
7 2202
 
7.1%
4 1921
 
6.2%
5 1856
 
6.0%
6 1759
 
5.7%

year
Text

Missing 

Distinct156
Distinct (%)1.3%
Missing6572
Missing (%)34.8%
Memory size147.5 KiB
2025-01-08T18:33:05.151160image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters49176
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row2024
2nd row2023
3rd row2023
4th row2023
5th row2023
ValueCountFrequency (%)
2013 864
 
7.0%
2012 821
 
6.7%
2020 800
 
6.5%
2014 728
 
5.9%
1965 712
 
5.8%
1962 340
 
2.8%
1956 325
 
2.6%
1964 288
 
2.3%
1959 284
 
2.3%
1952 274
 
2.2%
Other values (146) 6858
55.8%
2025-01-08T18:33:05.351192image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 11749
23.9%
9 8330
16.9%
2 7669
15.6%
0 6637
13.5%
5 3388
 
6.9%
6 3295
 
6.7%
3 2758
 
5.6%
7 1878
 
3.8%
4 1818
 
3.7%
8 1654
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 49176
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 11749
23.9%
9 8330
16.9%
2 7669
15.6%
0 6637
13.5%
5 3388
 
6.9%
6 3295
 
6.7%
3 2758
 
5.6%
7 1878
 
3.8%
4 1818
 
3.7%
8 1654
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
Common 49176
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 11749
23.9%
9 8330
16.9%
2 7669
15.6%
0 6637
13.5%
5 3388
 
6.9%
6 3295
 
6.7%
3 2758
 
5.6%
7 1878
 
3.8%
4 1818
 
3.7%
8 1654
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49176
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 11749
23.9%
9 8330
16.9%
2 7669
15.6%
0 6637
13.5%
5 3388
 
6.9%
6 3295
 
6.7%
3 2758
 
5.6%
7 1878
 
3.8%
4 1818
 
3.7%
8 1654
 
3.4%

month
Text

Missing 

Distinct12
Distinct (%)0.1%
Missing7472
Missing (%)39.6%
Memory size147.5 KiB
2025-01-08T18:33:05.411484image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length1
Mean length1.204318062
Min length1

Characters and Unicode

Total characters13722
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row8
2nd row12
3rd row12
4th row12
5th row11
ValueCountFrequency (%)
7 2621
23.0%
8 1678
14.7%
10 1318
11.6%
6 1172
10.3%
9 828
 
7.3%
1 718
 
6.3%
11 605
 
5.3%
5 553
 
4.9%
3 508
 
4.5%
4 496
 
4.4%
Other values (2) 897
 
7.9%
2025-01-08T18:33:05.510322image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3651
26.6%
7 2621
19.1%
8 1678
12.2%
0 1318
 
9.6%
6 1172
 
8.5%
2 897
 
6.5%
9 828
 
6.0%
5 553
 
4.0%
3 508
 
3.7%
4 496
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13722
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3651
26.6%
7 2621
19.1%
8 1678
12.2%
0 1318
 
9.6%
6 1172
 
8.5%
2 897
 
6.5%
9 828
 
6.0%
5 553
 
4.0%
3 508
 
3.7%
4 496
 
3.6%

Most occurring scripts

ValueCountFrequency (%)
Common 13722
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 3651
26.6%
7 2621
19.1%
8 1678
12.2%
0 1318
 
9.6%
6 1172
 
8.5%
2 897
 
6.5%
9 828
 
6.0%
5 553
 
4.0%
3 508
 
3.7%
4 496
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13722
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3651
26.6%
7 2621
19.1%
8 1678
12.2%
0 1318
 
9.6%
6 1172
 
8.5%
2 897
 
6.5%
9 828
 
6.0%
5 553
 
4.0%
3 508
 
3.7%
4 496
 
3.6%

day
Text

Missing 

Distinct31
Distinct (%)0.3%
Missing7989
Missing (%)42.3%
Memory size147.5 KiB
2025-01-08T18:33:05.576609image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length1.67812816
Min length1

Characters and Unicode

Total characters18253
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15
2nd row1
3rd row28
4th row20
5th row30
ValueCountFrequency (%)
18 551
 
5.1%
15 518
 
4.8%
7 470
 
4.3%
16 465
 
4.3%
8 445
 
4.1%
9 433
 
4.0%
24 428
 
3.9%
2 425
 
3.9%
19 410
 
3.8%
4 386
 
3.5%
Other values (21) 6346
58.3%
2025-01-08T18:33:05.693012image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5053
27.7%
2 3974
21.8%
3 1359
 
7.4%
8 1236
 
6.8%
4 1179
 
6.5%
5 1143
 
6.3%
7 1122
 
6.1%
6 1111
 
6.1%
9 1085
 
5.9%
0 991
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18253
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5053
27.7%
2 3974
21.8%
3 1359
 
7.4%
8 1236
 
6.8%
4 1179
 
6.5%
5 1143
 
6.3%
7 1122
 
6.1%
6 1111
 
6.1%
9 1085
 
5.9%
0 991
 
5.4%

Most occurring scripts

ValueCountFrequency (%)
Common 18253
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5053
27.7%
2 3974
21.8%
3 1359
 
7.4%
8 1236
 
6.8%
4 1179
 
6.5%
5 1143
 
6.3%
7 1122
 
6.1%
6 1111
 
6.1%
9 1085
 
5.9%
0 991
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18253
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5053
27.7%
2 3974
21.8%
3 1359
 
7.4%
8 1236
 
6.8%
4 1179
 
6.5%
5 1143
 
6.3%
7 1122
 
6.1%
6 1111
 
6.1%
9 1085
 
5.9%
0 991
 
5.4%

habitat
Text

Missing 

Distinct49
Distinct (%)38.6%
Missing18739
Missing (%)99.3%
Memory size147.5 KiB
2025-01-08T18:33:05.838007image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length185
Median length88
Mean length16.97637795
Min length5

Characters and Unicode

Total characters2156
Distinct characters46
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)29.9%

Sample

1st rowUrban
2nd rowUrban
3rd rowUrban
4th rowUrban
5th rowUrban
ValueCountFrequency (%)
urban 50
 
14.2%
in 21
 
5.9%
suburban 18
 
5.1%
forest 10
 
2.8%
by 8
 
2.3%
pine 7
 
2.0%
open 6
 
1.7%
of 6
 
1.7%
ponderosa 6
 
1.7%
soil 5
 
1.4%
Other values (132) 216
61.2%
2025-01-08T18:33:06.060771image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
226
 
10.5%
a 205
 
9.5%
n 189
 
8.8%
r 178
 
8.3%
e 162
 
7.5%
o 131
 
6.1%
s 119
 
5.5%
b 116
 
5.4%
i 105
 
4.9%
t 98
 
4.5%
Other values (36) 627
29.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1794
83.2%
Space Separator 226
 
10.5%
Uppercase Letter 100
 
4.6%
Other Punctuation 31
 
1.4%
Decimal Number 3
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 205
11.4%
n 189
10.5%
r 178
9.9%
e 162
 
9.0%
o 131
 
7.3%
s 119
 
6.6%
b 116
 
6.5%
i 105
 
5.9%
t 98
 
5.5%
d 81
 
4.5%
Other values (14) 410
22.9%
Uppercase Letter
ValueCountFrequency (%)
U 50
50.0%
S 19
 
19.0%
P 11
 
11.0%
W 5
 
5.0%
R 3
 
3.0%
B 3
 
3.0%
C 3
 
3.0%
E 2
 
2.0%
V 1
 
1.0%
F 1
 
1.0%
Other values (2) 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 20
64.5%
. 4
 
12.9%
; 3
 
9.7%
" 2
 
6.5%
: 1
 
3.2%
' 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
0 2
66.7%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
226
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1894
87.8%
Common 262
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 205
10.8%
n 189
 
10.0%
r 178
 
9.4%
e 162
 
8.6%
o 131
 
6.9%
s 119
 
6.3%
b 116
 
6.1%
i 105
 
5.5%
t 98
 
5.2%
d 81
 
4.3%
Other values (26) 510
26.9%
Common
ValueCountFrequency (%)
226
86.3%
, 20
 
7.6%
. 4
 
1.5%
; 3
 
1.1%
" 2
 
0.8%
- 2
 
0.8%
0 2
 
0.8%
: 1
 
0.4%
1 1
 
0.4%
' 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2156
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
226
 
10.5%
a 205
 
9.5%
n 189
 
8.8%
r 178
 
8.3%
e 162
 
7.5%
o 131
 
6.1%
s 119
 
5.5%
b 116
 
5.4%
i 105
 
4.9%
t 98
 
4.5%
Other values (36) 627
29.1%

higherGeography
Text

Missing 

Distinct951
Distinct (%)6.3%
Missing3778
Missing (%)20.0%
Memory size147.5 KiB
2025-01-08T18:33:06.249038image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length74
Median length66
Mean length40.53313892
Min length4

Characters and Unicode

Total characters611564
Distinct characters63
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique314 ?
Unique (%)2.1%

Sample

1st rowNorth America; USA; Connecticut; New Haven County
2nd rowNorth America; USA; Connecticut; Middlesex County
3rd rowNorth America; USA; Connecticut; Middlesex County
4th rowNorth America; USA; Connecticut; Middlesex County
5th rowNorth America; USA; Connecticut; Middlesex County
ValueCountFrequency (%)
america 11919
14.1%
north 11535
13.6%
usa 10091
 
11.9%
county 9449
 
11.1%
new 4323
 
5.1%
hampshire 2881
 
3.4%
carroll 2750
 
3.2%
africa 2011
 
2.4%
connecticut 1497
 
1.8%
province 1319
 
1.6%
Other values (974) 27017
31.9%
2025-01-08T18:33:06.517902image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
69704
 
11.4%
r 45359
 
7.4%
a 42864
 
7.0%
o 40987
 
6.7%
; 38761
 
6.3%
e 35621
 
5.8%
i 34064
 
5.6%
t 32303
 
5.3%
n 28583
 
4.7%
A 26232
 
4.3%
Other values (53) 217086
35.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 397956
65.1%
Uppercase Letter 105028
 
17.2%
Space Separator 69704
 
11.4%
Other Punctuation 38819
 
6.3%
Dash Punctuation 57
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 45359
11.4%
a 42864
10.8%
o 40987
10.3%
e 35621
9.0%
i 34064
8.6%
t 32303
 
8.1%
n 28583
 
7.2%
c 22577
 
5.7%
h 17759
 
4.5%
m 16806
 
4.2%
Other values (20) 81033
20.4%
Uppercase Letter
ValueCountFrequency (%)
A 26232
25.0%
C 17618
16.8%
N 16467
15.7%
S 12399
11.8%
U 10244
 
9.8%
H 3948
 
3.8%
M 2894
 
2.8%
P 2510
 
2.4%
E 1256
 
1.2%
G 1207
 
1.1%
Other values (16) 10253
 
9.8%
Other Punctuation
ValueCountFrequency (%)
; 38761
99.9%
. 28
 
0.1%
' 28
 
0.1%
& 1
 
< 0.1%
, 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
69704
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 502984
82.2%
Common 108580
 
17.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 45359
 
9.0%
a 42864
 
8.5%
o 40987
 
8.1%
e 35621
 
7.1%
i 34064
 
6.8%
t 32303
 
6.4%
n 28583
 
5.7%
A 26232
 
5.2%
c 22577
 
4.5%
h 17759
 
3.5%
Other values (46) 176635
35.1%
Common
ValueCountFrequency (%)
69704
64.2%
; 38761
35.7%
- 57
 
0.1%
. 28
 
< 0.1%
' 28
 
< 0.1%
& 1
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 611460
> 99.9%
None 104
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
69704
 
11.4%
r 45359
 
7.4%
a 42864
 
7.0%
o 40987
 
6.7%
; 38761
 
6.3%
e 35621
 
5.8%
i 34064
 
5.6%
t 32303
 
5.3%
n 28583
 
4.7%
A 26232
 
4.3%
Other values (48) 216982
35.5%
None
ValueCountFrequency (%)
á 72
69.2%
í 16
 
15.4%
é 11
 
10.6%
ó 4
 
3.8%
Á 1
 
1.0%

continent
Text

Missing 

Distinct7
Distinct (%)< 0.1%
Missing3874
Missing (%)20.5%
Memory size147.5 KiB
2025-01-08T18:33:06.582848image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length11.49086179
Min length4

Characters and Unicode

Total characters172271
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNORTH_AMERICA
2nd rowNORTH_AMERICA
3rd rowNORTH_AMERICA
4th rowNORTH_AMERICA
5th rowNORTH_AMERICA
ValueCountFrequency (%)
north_america 11386
75.9%
africa 1991
 
13.3%
asia 648
 
4.3%
south_america 537
 
3.6%
europe 279
 
1.9%
oceania 150
 
1.0%
antarctica 1
 
< 0.1%
2025-01-08T18:33:06.684276image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 29427
17.1%
R 25580
14.8%
I 14713
8.5%
C 14066
8.2%
E 12631
7.3%
O 12352
7.2%
T 11925
6.9%
H 11923
6.9%
_ 11923
6.9%
M 11923
6.9%
Other values (5) 15808
9.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 160348
93.1%
Connector Punctuation 11923
 
6.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 29427
18.4%
R 25580
16.0%
I 14713
9.2%
C 14066
8.8%
E 12631
7.9%
O 12352
7.7%
T 11925
7.4%
H 11923
7.4%
M 11923
7.4%
N 11537
 
7.2%
Other values (4) 4271
 
2.7%
Connector Punctuation
ValueCountFrequency (%)
_ 11923
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 160348
93.1%
Common 11923
 
6.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 29427
18.4%
R 25580
16.0%
I 14713
9.2%
C 14066
8.8%
E 12631
7.9%
O 12352
7.7%
T 11925
7.4%
H 11923
7.4%
M 11923
7.4%
N 11537
 
7.2%
Other values (4) 4271
 
2.7%
Common
ValueCountFrequency (%)
_ 11923
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 172271
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 29427
17.1%
R 25580
14.8%
I 14713
8.5%
C 14066
8.2%
E 12631
7.3%
O 12352
7.2%
T 11925
6.9%
H 11923
6.9%
_ 11923
6.9%
M 11923
6.9%
Other values (5) 15808
9.2%

waterBody
Text

Missing 

Distinct7
Distinct (%)5.5%
Missing18739
Missing (%)99.3%
Memory size147.5 KiB
2025-01-08T18:33:06.733029image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length38
Median length29
Mean length23.07874016
Min length12

Characters and Unicode

Total characters2931
Distinct characters25
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st rowAtlantic Ocean; Caribbean Sea
2nd rowAtlantic Ocean; Caribbean Sea
3rd rowAtlantic Ocean; Caribbean Sea
4th rowAtlantic Ocean; Caribbean Sea
5th rowAtlantic Ocean; Caribbean Sea
ValueCountFrequency (%)
ocean 127
30.5%
atlantic 87
20.9%
sea 79
19.0%
caribbean 78
18.8%
pacific 30
 
7.2%
indian 9
 
2.2%
arctic 1
 
0.2%
red 1
 
0.2%
gulf 1
 
0.2%
of 1
 
0.2%
Other values (2) 2
 
0.5%
2025-01-08T18:33:06.833389image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 490
16.7%
n 312
10.6%
289
9.9%
e 287
9.8%
c 277
9.5%
i 236
8.1%
t 176
 
6.0%
b 156
 
5.3%
O 127
 
4.3%
A 88
 
3.0%
Other values (15) 493
16.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2147
73.3%
Uppercase Letter 415
 
14.2%
Space Separator 289
 
9.9%
Other Punctuation 80
 
2.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 490
22.8%
n 312
14.5%
e 287
13.4%
c 277
12.9%
i 236
11.0%
t 176
 
8.2%
b 156
 
7.3%
l 88
 
4.1%
r 80
 
3.7%
f 32
 
1.5%
Other values (4) 13
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
O 127
30.6%
A 88
21.2%
S 80
19.3%
C 78
18.8%
P 30
 
7.2%
I 9
 
2.2%
R 1
 
0.2%
G 1
 
0.2%
L 1
 
0.2%
Space Separator
ValueCountFrequency (%)
289
100.0%
Other Punctuation
ValueCountFrequency (%)
; 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2562
87.4%
Common 369
 
12.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 490
19.1%
n 312
12.2%
e 287
11.2%
c 277
10.8%
i 236
9.2%
t 176
 
6.9%
b 156
 
6.1%
O 127
 
5.0%
A 88
 
3.4%
l 88
 
3.4%
Other values (13) 325
12.7%
Common
ValueCountFrequency (%)
289
78.3%
; 80
 
21.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2931
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 490
16.7%
n 312
10.6%
289
9.9%
e 287
9.8%
c 277
9.5%
i 236
8.1%
t 176
 
6.0%
b 156
 
5.3%
O 127
 
4.3%
A 88
 
3.0%
Other values (15) 493
16.8%

countryCode
Text

Missing 

Distinct105
Distinct (%)0.7%
Missing3974
Missing (%)21.1%
Memory size147.5 KiB
2025-01-08T18:33:06.922394image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters29784
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)0.1%

Sample

1st rowUS
2nd rowUS
3rd rowUS
4th rowUS
5th rowUS
ValueCountFrequency (%)
us 10088
67.7%
ca 686
 
4.6%
ke 667
 
4.5%
mx 578
 
3.9%
eg 430
 
2.9%
id 279
 
1.9%
cm 254
 
1.7%
ec 238
 
1.6%
gr 138
 
0.9%
au 112
 
0.8%
Other values (95) 1422
 
9.5%
2025-01-08T18:33:07.066793image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
U 10251
34.4%
S 10220
34.3%
E 1412
 
4.7%
C 1380
 
4.6%
M 1059
 
3.6%
A 911
 
3.1%
G 811
 
2.7%
K 743
 
2.5%
X 578
 
1.9%
I 432
 
1.5%
Other values (16) 1987
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 29784
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
U 10251
34.4%
S 10220
34.3%
E 1412
 
4.7%
C 1380
 
4.6%
M 1059
 
3.6%
A 911
 
3.1%
G 811
 
2.7%
K 743
 
2.5%
X 578
 
1.9%
I 432
 
1.5%
Other values (16) 1987
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Latin 29784
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
U 10251
34.4%
S 10220
34.3%
E 1412
 
4.7%
C 1380
 
4.6%
M 1059
 
3.6%
A 911
 
3.1%
G 811
 
2.7%
K 743
 
2.5%
X 578
 
1.9%
I 432
 
1.5%
Other values (16) 1987
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29784
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
U 10251
34.4%
S 10220
34.3%
E 1412
 
4.7%
C 1380
 
4.6%
M 1059
 
3.6%
A 911
 
3.1%
G 811
 
2.7%
K 743
 
2.5%
X 578
 
1.9%
I 432
 
1.5%
Other values (16) 1987
 
6.7%

stateProvince
Text

Missing 

Distinct260
Distinct (%)1.9%
Missing5347
Missing (%)28.3%
Memory size147.5 KiB
2025-01-08T18:33:07.238484image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length25
Mean length11.24032843
Min length3

Characters and Unicode

Total characters151958
Distinct characters58
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)0.5%

Sample

1st rowConnecticut
2nd rowConnecticut
3rd rowConnecticut
4th rowConnecticut
5th rowConnecticut
ValueCountFrequency (%)
new 3586
17.2%
hampshire 2877
 
13.8%
connecticut 1497
 
7.2%
province 1288
 
6.2%
state 613
 
2.9%
minnesota 580
 
2.8%
york 506
 
2.4%
colorado 463
 
2.2%
arizona 438
 
2.1%
wisconsin 425
 
2.0%
Other values (287) 8636
41.3%
2025-01-08T18:33:07.479464image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 14301
 
9.4%
a 13747
 
9.0%
i 12892
 
8.5%
n 10792
 
7.1%
o 10626
 
7.0%
r 9355
 
6.2%
t 8029
 
5.3%
s 7835
 
5.2%
7390
 
4.9%
c 5643
 
3.7%
Other values (48) 51348
33.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 123691
81.4%
Uppercase Letter 20861
 
13.7%
Space Separator 7390
 
4.9%
Dash Punctuation 15
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 14301
11.6%
a 13747
11.1%
i 12892
10.4%
n 10792
8.7%
o 10626
8.6%
r 9355
 
7.6%
t 8029
 
6.5%
s 7835
 
6.3%
c 5643
 
4.6%
h 4465
 
3.6%
Other values (20) 26006
21.0%
Uppercase Letter
ValueCountFrequency (%)
N 4092
19.6%
C 3272
15.7%
H 2932
14.1%
P 1777
8.5%
M 1528
 
7.3%
A 1144
 
5.5%
S 846
 
4.1%
W 788
 
3.8%
V 591
 
2.8%
Y 507
 
2.4%
Other values (15) 3384
16.2%
Space Separator
ValueCountFrequency (%)
7390
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 144552
95.1%
Common 7406
 
4.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 14301
 
9.9%
a 13747
 
9.5%
i 12892
 
8.9%
n 10792
 
7.5%
o 10626
 
7.4%
r 9355
 
6.5%
t 8029
 
5.6%
s 7835
 
5.4%
c 5643
 
3.9%
h 4465
 
3.1%
Other values (45) 46867
32.4%
Common
ValueCountFrequency (%)
7390
99.8%
- 15
 
0.2%
' 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 151865
99.9%
None 93
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 14301
 
9.4%
a 13747
 
9.1%
i 12892
 
8.5%
n 10792
 
7.1%
o 10626
 
7.0%
r 9355
 
6.2%
t 8029
 
5.3%
s 7835
 
5.2%
7390
 
4.9%
c 5643
 
3.7%
Other values (44) 51255
33.8%
None
ValueCountFrequency (%)
á 72
77.4%
í 16
 
17.2%
ó 3
 
3.2%
é 2
 
2.2%

county
Text

Missing 

Distinct484
Distinct (%)5.0%
Missing9192
Missing (%)48.7%
Memory size147.5 KiB
2025-01-08T18:33:07.663216image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length28
Median length27
Mean length14.43198263
Min length6

Characters and Unicode

Total characters139615
Distinct characters57
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)1.6%

Sample

1st rowNew Haven County
2nd rowMiddlesex County
3rd rowMiddlesex County
4th rowMiddlesex County
5th rowMiddlesex County
ValueCountFrequency (%)
county 9433
45.6%
carroll 2750
 
13.3%
new 705
 
3.4%
haven 655
 
3.2%
cass 356
 
1.7%
litchfield 334
 
1.6%
gunnison 275
 
1.3%
fairfield 220
 
1.1%
iron 203
 
1.0%
middlesex 167
 
0.8%
Other values (517) 5606
27.1%
2025-01-08T18:33:07.908379image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 15785
11.3%
n 14029
10.0%
C 13107
9.4%
t 11232
 
8.0%
11030
 
7.9%
u 10798
 
7.7%
y 9773
 
7.0%
r 8601
 
6.2%
l 7999
 
5.7%
a 7541
 
5.4%
Other values (47) 29720
21.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 107635
77.1%
Uppercase Letter 20854
 
14.9%
Space Separator 11030
 
7.9%
Other Punctuation 55
 
< 0.1%
Dash Punctuation 41
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 15785
14.7%
n 14029
13.0%
t 11232
10.4%
u 10798
10.0%
y 9773
9.1%
r 8601
8.0%
l 7999
7.4%
a 7541
7.0%
e 5545
 
5.2%
i 3786
 
3.5%
Other values (18) 12546
11.7%
Uppercase Letter
ValueCountFrequency (%)
C 13107
62.9%
H 924
 
4.4%
L 867
 
4.2%
N 832
 
4.0%
S 670
 
3.2%
M 609
 
2.9%
F 535
 
2.6%
G 503
 
2.4%
P 490
 
2.3%
B 456
 
2.2%
Other values (15) 1861
 
8.9%
Other Punctuation
ValueCountFrequency (%)
. 28
50.9%
' 27
49.1%
Space Separator
ValueCountFrequency (%)
11030
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 128489
92.0%
Common 11126
 
8.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 15785
12.3%
n 14029
10.9%
C 13107
10.2%
t 11232
8.7%
u 10798
8.4%
y 9773
7.6%
r 8601
 
6.7%
l 7999
 
6.2%
a 7541
 
5.9%
e 5545
 
4.3%
Other values (43) 24079
18.7%
Common
ValueCountFrequency (%)
11030
99.1%
- 41
 
0.4%
. 28
 
0.3%
' 27
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 139604
> 99.9%
None 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 15785
11.3%
n 14029
10.0%
C 13107
9.4%
t 11232
 
8.0%
11030
 
7.9%
u 10798
 
7.7%
y 9773
 
7.0%
r 8601
 
6.2%
l 7999
 
5.7%
a 7541
 
5.4%
Other values (44) 29709
21.3%
None
ValueCountFrequency (%)
é 9
81.8%
Á 1
 
9.1%
ó 1
 
9.1%

municipality
Text

Missing 

Distinct93
Distinct (%)16.7%
Missing18309
Missing (%)97.0%
Memory size147.5 KiB
2025-01-08T18:33:08.007543image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length25
Median length19
Mean length8.47935368
Min length4

Characters and Unicode

Total characters4723
Distinct characters49
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)6.6%

Sample

1st rowRedding
2nd rowHamden
3rd rowHamden
4th rowPerkasie
5th rowPhiladelphia
ValueCountFrequency (%)
parksville 56
 
8.5%
fairfield 39
 
5.9%
westport 35
 
5.3%
kent 32
 
4.9%
norwalk 29
 
4.4%
lloyd 27
 
4.1%
harbor 27
 
4.1%
new 25
 
3.8%
quince 24
 
3.6%
mil 24
 
3.6%
Other values (98) 340
51.7%
2025-01-08T18:33:08.156738image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 421
 
8.9%
e 410
 
8.7%
a 396
 
8.4%
r 359
 
7.6%
i 356
 
7.5%
o 282
 
6.0%
n 258
 
5.5%
t 205
 
4.3%
s 184
 
3.9%
d 163
 
3.5%
Other values (39) 1689
35.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3961
83.9%
Uppercase Letter 658
 
13.9%
Space Separator 101
 
2.1%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 421
10.6%
e 410
10.4%
a 396
10.0%
r 359
 
9.1%
i 356
 
9.0%
o 282
 
7.1%
n 258
 
6.5%
t 205
 
5.2%
s 184
 
4.6%
d 163
 
4.1%
Other values (13) 927
23.4%
Uppercase Letter
ValueCountFrequency (%)
P 108
16.4%
N 59
9.0%
W 58
 
8.8%
M 55
 
8.4%
H 49
 
7.4%
F 43
 
6.5%
L 41
 
6.2%
K 36
 
5.5%
B 35
 
5.3%
Q 28
 
4.3%
Other values (12) 146
22.2%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4619
97.8%
Common 104
 
2.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 421
 
9.1%
e 410
 
8.9%
a 396
 
8.6%
r 359
 
7.8%
i 356
 
7.7%
o 282
 
6.1%
n 258
 
5.6%
t 205
 
4.4%
s 184
 
4.0%
d 163
 
3.5%
Other values (35) 1585
34.3%
Common
ValueCountFrequency (%)
101
97.1%
, 1
 
1.0%
- 1
 
1.0%
& 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4723
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 421
 
8.9%
e 410
 
8.7%
a 396
 
8.4%
r 359
 
7.6%
i 356
 
7.5%
o 282
 
6.0%
n 258
 
5.5%
t 205
 
4.3%
s 184
 
3.9%
d 163
 
3.5%
Other values (39) 1689
35.8%

locality
Text

Missing 

Distinct2520
Distinct (%)19.4%
Missing5869
Missing (%)31.1%
Memory size147.5 KiB
2025-01-08T18:33:08.332310image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length136
Median length96
Mean length26.34000154
Min length3

Characters and Unicode

Total characters342341
Distinct characters86
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1275 ?
Unique (%)9.8%

Sample

1st rowNew Haven. Yale University, Peabody Museum
2nd rowClinton. 245 Killingworth Turnpike
3rd rowClinton. 245 Killingworth Turnpike
4th rowClinton. 245 Killingworth Turnpike
5th rowClinton. 245 Killingworth Turnpike
ValueCountFrequency (%)
forest 3560
 
6.6%
experimental 2766
 
5.1%
bartlett 2744
 
5.1%
of 2288
 
4.2%
comp 1856
 
3.4%
miles 929
 
1.7%
transect 736
 
1.4%
mi 727
 
1.3%
national 657
 
1.2%
island 533
 
1.0%
Other values (3204) 37538
69.1%
2025-01-08T18:33:08.593966image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41355
 
12.1%
e 29260
 
8.5%
a 26473
 
7.7%
t 25066
 
7.3%
o 21028
 
6.1%
r 19414
 
5.7%
n 17045
 
5.0%
i 15970
 
4.7%
l 15843
 
4.6%
s 12163
 
3.6%
Other values (76) 118724
34.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 240976
70.4%
Space Separator 41355
 
12.1%
Uppercase Letter 38660
 
11.3%
Decimal Number 10790
 
3.2%
Other Punctuation 9066
 
2.6%
Dash Punctuation 863
 
0.3%
Open Punctuation 261
 
0.1%
Close Punctuation 261
 
0.1%
Math Symbol 109
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 29260
12.1%
a 26473
11.0%
t 25066
10.4%
o 21028
8.7%
r 19414
8.1%
n 17045
 
7.1%
i 15970
 
6.6%
l 15843
 
6.6%
s 12163
 
5.0%
m 10319
 
4.3%
Other values (20) 48395
20.1%
Uppercase Letter
ValueCountFrequency (%)
F 4122
10.7%
C 4107
10.6%
B 4072
10.5%
E 3542
 
9.2%
S 2650
 
6.9%
M 2630
 
6.8%
N 2411
 
6.2%
R 2164
 
5.6%
P 1557
 
4.0%
A 1375
 
3.6%
Other values (16) 10030
25.9%
Other Punctuation
ValueCountFrequency (%)
. 6402
70.6%
, 1832
 
20.2%
/ 501
 
5.5%
' 120
 
1.3%
? 66
 
0.7%
; 48
 
0.5%
" 40
 
0.4%
& 31
 
0.3%
: 22
 
0.2%
# 4
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 2029
18.8%
5 1908
17.7%
4 1565
14.5%
0 1214
11.3%
3 942
8.7%
6 915
8.5%
2 913
8.5%
7 511
 
4.7%
8 475
 
4.4%
9 318
 
2.9%
Close Punctuation
ValueCountFrequency (%)
] 212
81.2%
) 48
 
18.4%
} 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
= 105
96.3%
~ 2
 
1.8%
+ 2
 
1.8%
Open Punctuation
ValueCountFrequency (%)
[ 213
81.6%
( 48
 
18.4%
Space Separator
ValueCountFrequency (%)
41355
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 863
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 279636
81.7%
Common 62705
 
18.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 29260
 
10.5%
a 26473
 
9.5%
t 25066
 
9.0%
o 21028
 
7.5%
r 19414
 
6.9%
n 17045
 
6.1%
i 15970
 
5.7%
l 15843
 
5.7%
s 12163
 
4.3%
m 10319
 
3.7%
Other values (46) 87055
31.1%
Common
ValueCountFrequency (%)
41355
66.0%
. 6402
 
10.2%
1 2029
 
3.2%
5 1908
 
3.0%
, 1832
 
2.9%
4 1565
 
2.5%
0 1214
 
1.9%
3 942
 
1.5%
6 915
 
1.5%
2 913
 
1.5%
Other values (20) 3630
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 342324
> 99.9%
None 17
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41355
 
12.1%
e 29260
 
8.5%
a 26473
 
7.7%
t 25066
 
7.3%
o 21028
 
6.1%
r 19414
 
5.7%
n 17045
 
5.0%
i 15970
 
4.7%
l 15843
 
4.6%
s 12163
 
3.6%
Other values (72) 118707
34.7%
None
ValueCountFrequency (%)
í 8
47.1%
ç 4
23.5%
ö 4
23.5%
á 1
 
5.9%

verbatimElevation
Text

Missing 

Distinct195
Distinct (%)13.2%
Missing17391
Missing (%)92.2%
Memory size147.5 KiB
2025-01-08T18:33:08.799075image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.446779661
Min length4

Characters and Unicode

Total characters12459
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)3.9%

Sample

1st row200-200 ft
2nd row200-200 ft
3rd row638 m
4th row638 m
5th row1143 m
ValueCountFrequency (%)
m 858
29.1%
ft 617
20.9%
200-200 104
 
3.5%
1829 84
 
2.8%
700 58
 
2.0%
638 56
 
1.9%
2134 56
 
1.9%
6000-6000 40
 
1.4%
500 39
 
1.3%
2896 33
 
1.1%
Other values (172) 1005
34.1%
2025-01-08T18:33:08.962189image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3601
28.9%
1475
11.8%
m 858
 
6.9%
- 784
 
6.3%
2 739
 
5.9%
1 683
 
5.5%
f 617
 
5.0%
t 617
 
5.0%
8 490
 
3.9%
4 490
 
3.9%
Other values (5) 2105
16.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8108
65.1%
Lowercase Letter 2092
 
16.8%
Space Separator 1475
 
11.8%
Dash Punctuation 784
 
6.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3601
44.4%
2 739
 
9.1%
1 683
 
8.4%
8 490
 
6.0%
4 490
 
6.0%
5 468
 
5.8%
3 458
 
5.6%
6 449
 
5.5%
9 375
 
4.6%
7 355
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
m 858
41.0%
f 617
29.5%
t 617
29.5%
Space Separator
ValueCountFrequency (%)
1475
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 784
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10367
83.2%
Latin 2092
 
16.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3601
34.7%
1475
14.2%
- 784
 
7.6%
2 739
 
7.1%
1 683
 
6.6%
8 490
 
4.7%
4 490
 
4.7%
5 468
 
4.5%
3 458
 
4.4%
6 449
 
4.3%
Other values (2) 730
 
7.0%
Latin
ValueCountFrequency (%)
m 858
41.0%
f 617
29.5%
t 617
29.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12459
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3601
28.9%
1475
11.8%
m 858
 
6.9%
- 784
 
6.3%
2 739
 
5.9%
1 683
 
5.5%
f 617
 
5.0%
t 617
 
5.0%
8 490
 
3.9%
4 490
 
3.9%
Other values (5) 2105
16.9%

decimalLatitude
Text

Missing 

Distinct2246
Distinct (%)16.9%
Missing5543
Missing (%)29.4%
Memory size147.5 KiB
2025-01-08T18:33:09.160494image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length9
Mean length7.516775501
Min length3

Characters and Unicode

Total characters100146
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1060 ?
Unique (%)8.0%

Sample

1st row41.358889
2nd row41.358889
3rd row40.280472
4th row39.966055
5th row40.280472
ValueCountFrequency (%)
44.049466 311
 
2.3%
44.059277 252
 
1.9%
44.062155 245
 
1.8%
3.9167 244
 
1.8%
44.05088 232
 
1.7%
44.061185 228
 
1.7%
44.041766 222
 
1.7%
44.059944 204
 
1.5%
41.3931 147
 
1.1%
41.3081 130
 
1.0%
Other values (2207) 11108
83.4%
2025-01-08T18:33:09.419511image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 15803
15.8%
. 13323
13.3%
3 11104
11.1%
1 8831
8.8%
6 8363
8.4%
5 7455
7.4%
0 7343
7.3%
7 6908
6.9%
2 6851
6.8%
8 6767
6.8%
Other values (2) 7398
7.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 85426
85.3%
Other Punctuation 13323
 
13.3%
Dash Punctuation 1397
 
1.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 15803
18.5%
3 11104
13.0%
1 8831
10.3%
6 8363
9.8%
5 7455
8.7%
0 7343
8.6%
7 6908
8.1%
2 6851
8.0%
8 6767
7.9%
9 6001
 
7.0%
Other Punctuation
ValueCountFrequency (%)
. 13323
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1397
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100146
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 15803
15.8%
. 13323
13.3%
3 11104
11.1%
1 8831
8.8%
6 8363
8.4%
5 7455
7.4%
0 7343
7.3%
7 6908
6.9%
2 6851
6.8%
8 6767
6.8%
Other values (2) 7398
7.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100146
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 15803
15.8%
. 13323
13.3%
3 11104
11.1%
1 8831
8.8%
6 8363
8.4%
5 7455
7.4%
0 7343
7.3%
7 6908
6.9%
2 6851
6.8%
8 6767
6.8%
Other values (2) 7398
7.4%

decimalLongitude
Text

Missing 

Distinct2284
Distinct (%)17.1%
Missing5543
Missing (%)29.4%
Memory size147.5 KiB
2025-01-08T18:33:09.622746image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.606995421
Min length3

Characters and Unicode

Total characters114671
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1091 ?
Unique (%)8.2%

Sample

1st row-72.903807
2nd row-72.903807
3rd row-75.050684
4th row-75.195683
5th row-75.050684
ValueCountFrequency (%)
71.27383 311
 
2.3%
71.304611 252
 
1.9%
71.297795 245
 
1.8%
136.1667 244
 
1.8%
71.307927 232
 
1.7%
71.303074 228
 
1.7%
71.319924 222
 
1.7%
71.308122 204
 
1.5%
71.290348 160
 
1.2%
72.8972 148
 
1.1%
Other values (2267) 11077
83.1%
2025-01-08T18:33:09.878174image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 14318
12.5%
7 13516
11.8%
. 13323
11.6%
3 11679
10.2%
- 11234
9.8%
2 8832
7.7%
6 7982
7.0%
9 7758
6.8%
0 7552
6.6%
8 7489
6.5%
Other values (2) 10988
9.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90114
78.6%
Other Punctuation 13323
 
11.6%
Dash Punctuation 11234
 
9.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 14318
15.9%
7 13516
15.0%
3 11679
13.0%
2 8832
9.8%
6 7982
8.9%
9 7758
8.6%
0 7552
8.4%
8 7489
8.3%
5 5508
 
6.1%
4 5480
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 13323
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11234
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 114671
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 14318
12.5%
7 13516
11.8%
. 13323
11.6%
3 11679
10.2%
- 11234
9.8%
2 8832
7.7%
6 7982
7.0%
9 7758
6.8%
0 7552
6.6%
8 7489
6.5%
Other values (2) 10988
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 114671
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 14318
12.5%
7 13516
11.8%
. 13323
11.6%
3 11679
10.2%
- 11234
9.8%
2 8832
7.7%
6 7982
7.0%
9 7758
6.8%
0 7552
6.6%
8 7489
6.5%
Other values (2) 10988
9.6%
Distinct475
Distinct (%)3.6%
Missing5609
Missing (%)29.7%
Memory size147.5 KiB
2025-01-08T18:33:10.035098image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length9
Median length6
Mean length6.103341631
Min length4

Characters and Unicode

Total characters80912
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)1.7%

Sample

1st row5359.0
2nd row5359.0
3rd row5359.0
4th row5359.0
5th row5359.0
ValueCountFrequency (%)
1850.0 5476
41.3%
1851.0 4930
37.2%
111111.0 329
 
2.5%
3036.0 110
 
0.8%
1583.0 104
 
0.8%
301.0 97
 
0.7%
103733.0 86
 
0.6%
5000.0 84
 
0.6%
300.0 79
 
0.6%
500.0 66
 
0.5%
Other values (465) 1896
 
14.3%
2025-01-08T18:33:10.243381image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 20535
25.4%
1 19011
23.5%
. 13257
16.4%
5 11398
14.1%
8 11362
14.0%
3 1449
 
1.8%
4 978
 
1.2%
7 822
 
1.0%
6 746
 
0.9%
2 681
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 67655
83.6%
Other Punctuation 13257
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 20535
30.4%
1 19011
28.1%
5 11398
16.8%
8 11362
16.8%
3 1449
 
2.1%
4 978
 
1.4%
7 822
 
1.2%
6 746
 
1.1%
2 681
 
1.0%
9 673
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 13257
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 80912
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 20535
25.4%
1 19011
23.5%
. 13257
16.4%
5 11398
14.1%
8 11362
14.0%
3 1449
 
1.8%
4 978
 
1.2%
7 822
 
1.0%
6 746
 
0.9%
2 681
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80912
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 20535
25.4%
1 19011
23.5%
. 13257
16.4%
5 11398
14.1%
8 11362
14.0%
3 1449
 
1.8%
4 978
 
1.2%
7 822
 
1.0%
6 746
 
0.9%
2 681
 
0.8%

georeferencedBy
Text

Missing 

Distinct14
Distinct (%)4.3%
Missing18537
Missing (%)98.3%
Memory size147.5 KiB
2025-01-08T18:33:10.320020image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length26
Median length17
Mean length17.73860182
Min length13

Characters and Unicode

Total characters5836
Distinct characters42
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)1.5%

Sample

1st rowPiper L. Stepule
2nd rowPiper L. Stepule
3rd rowPeter A. Capainolo
4th rowKristof Zyskowski
5th rowNicholas J. Kerhoulas
ValueCountFrequency (%)
kristof 233
31.6%
zyskowski 233
31.6%
j 37
 
5.0%
gregory 24
 
3.3%
watkins-colwell 24
 
3.3%
peter 22
 
3.0%
a 22
 
3.0%
capainolo 22
 
3.0%
dornburg 14
 
1.9%
alex 14
 
1.9%
Other values (26) 93
 
12.6%
2025-01-08T18:33:10.448504image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 761
13.0%
o 607
 
10.4%
i 545
 
9.3%
k 497
 
8.5%
409
 
7.0%
r 364
 
6.2%
t 294
 
5.0%
y 269
 
4.6%
w 263
 
4.5%
K 251
 
4.3%
Other values (32) 1576
27.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4560
78.1%
Uppercase Letter 763
 
13.1%
Space Separator 409
 
7.0%
Other Punctuation 80
 
1.4%
Dash Punctuation 24
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 761
16.7%
o 607
13.3%
i 545
12.0%
k 497
10.9%
r 364
8.0%
t 294
 
6.4%
y 269
 
5.9%
w 263
 
5.8%
f 234
 
5.1%
e 163
 
3.6%
Other values (13) 563
12.3%
Uppercase Letter
ValueCountFrequency (%)
K 251
32.9%
Z 233
30.5%
C 49
 
6.4%
J 43
 
5.6%
A 39
 
5.1%
P 33
 
4.3%
W 30
 
3.9%
G 24
 
3.1%
D 17
 
2.2%
S 13
 
1.7%
Other values (6) 31
 
4.1%
Space Separator
ValueCountFrequency (%)
409
100.0%
Other Punctuation
ValueCountFrequency (%)
. 80
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5323
91.2%
Common 513
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 761
14.3%
o 607
11.4%
i 545
10.2%
k 497
9.3%
r 364
 
6.8%
t 294
 
5.5%
y 269
 
5.1%
w 263
 
4.9%
K 251
 
4.7%
f 234
 
4.4%
Other values (29) 1238
23.3%
Common
ValueCountFrequency (%)
409
79.7%
. 80
 
15.6%
- 24
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5836
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 761
13.0%
o 607
 
10.4%
i 545
 
9.3%
k 497
 
8.5%
409
 
7.0%
r 364
 
6.2%
t 294
 
5.0%
y 269
 
4.6%
w 263
 
4.5%
K 251
 
4.3%
Other values (32) 1576
27.0%

georeferencedDate
Text

Missing 

Distinct48
Distinct (%)0.6%
Missing10549
Missing (%)55.9%
Memory size147.5 KiB
2025-01-08T18:33:10.512504image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.131417578
Min length4

Characters and Unicode

Total characters75946
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015
ValueCountFrequency (%)
2023-12-28 5807
69.8%
2015 1204
 
14.5%
2020-06-14 935
 
11.2%
2020-12-30 124
 
1.5%
2023-12-03 45
 
0.5%
2021-12-08 27
 
0.3%
2024-01-17 18
 
0.2%
2024-05-01 17
 
0.2%
2019-11-04 16
 
0.2%
2022-06-18 14
 
0.2%
Other values (38) 110
 
1.3%
2025-01-08T18:33:10.629179image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 27323
36.0%
- 14226
18.7%
0 10723
 
14.1%
1 8422
 
11.1%
3 6079
 
8.0%
8 5869
 
7.7%
5 1236
 
1.6%
4 1028
 
1.4%
6 994
 
1.3%
7 24
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 61720
81.3%
Dash Punctuation 14226
 
18.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 27323
44.3%
0 10723
 
17.4%
1 8422
 
13.6%
3 6079
 
9.8%
8 5869
 
9.5%
5 1236
 
2.0%
4 1028
 
1.7%
6 994
 
1.6%
7 24
 
< 0.1%
9 22
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 14226
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 75946
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 27323
36.0%
- 14226
18.7%
0 10723
 
14.1%
1 8422
 
11.1%
3 6079
 
8.0%
8 5869
 
7.7%
5 1236
 
1.6%
4 1028
 
1.4%
6 994
 
1.3%
7 24
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75946
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 27323
36.0%
- 14226
18.7%
0 10723
 
14.1%
1 8422
 
11.1%
3 6079
 
8.0%
8 5869
 
7.7%
5 1236
 
1.6%
4 1028
 
1.4%
6 994
 
1.3%
7 24
 
< 0.1%

georeferenceProtocol
Text

Missing 

Distinct3
Distinct (%)< 0.1%
Missing5610
Missing (%)29.7%
Memory size147.5 KiB
2025-01-08T18:33:10.677275image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length16
Mean length13.75980688
Min length11

Characters and Unicode

Total characters182400
Distinct characters18
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdigital resource
2nd rowdigital resource
3rd rowdigital resource
4th rowdigital resource
5th rowdigital resource
ValueCountFrequency (%)
resource 7300
35.5%
digital 7216
35.1%
unspecified 5956
29.0%
physical 84
 
0.4%
2025-01-08T18:33:10.786068image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 26512
14.5%
i 26428
14.5%
r 14600
 
8.0%
s 13340
 
7.3%
c 13340
 
7.3%
u 13256
 
7.3%
d 13172
 
7.2%
7300
 
4.0%
l 7300
 
4.0%
a 7300
 
4.0%
Other values (8) 39852
21.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 175100
96.0%
Space Separator 7300
 
4.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 26512
15.1%
i 26428
15.1%
r 14600
8.3%
s 13340
7.6%
c 13340
7.6%
u 13256
7.6%
d 13172
7.5%
l 7300
 
4.2%
a 7300
 
4.2%
o 7300
 
4.2%
Other values (7) 32552
18.6%
Space Separator
ValueCountFrequency (%)
7300
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 175100
96.0%
Common 7300
 
4.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 26512
15.1%
i 26428
15.1%
r 14600
8.3%
s 13340
7.6%
c 13340
7.6%
u 13256
7.6%
d 13172
7.5%
l 7300
 
4.2%
a 7300
 
4.2%
o 7300
 
4.2%
Other values (7) 32552
18.6%
Common
ValueCountFrequency (%)
7300
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 182400
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 26512
14.5%
i 26428
14.5%
r 14600
 
8.0%
s 13340
 
7.3%
c 13340
 
7.3%
u 13256
 
7.3%
d 13172
 
7.2%
7300
 
4.0%
l 7300
 
4.0%
a 7300
 
4.0%
Other values (8) 39852
21.8%

georeferenceSources
Text

Missing 

Distinct14
Distinct (%)0.1%
Missing5615
Missing (%)29.8%
Memory size147.5 KiB
2025-01-08T18:33:10.844910image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length15
Mean length9.898347295
Min length4

Characters and Unicode

Total characters131163
Distinct characters42
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNEVP
2nd rowNEVP
3rd rowNEVP
4th rowNEVP
5th rowNEVP
ValueCountFrequency (%)
unspecified 5957
31.8%
unit 3838
20.5%
gps 3838
20.5%
geolocate 1254
 
6.7%
google 785
 
4.2%
earth 713
 
3.8%
vertnet 649
 
3.5%
2014 290
 
1.5%
census 290
 
1.5%
tiger 290
 
1.5%
Other values (11) 847
 
4.5%
2025-01-08T18:33:10.973093image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 16291
12.4%
e 15708
12.0%
n 10145
 
7.7%
u 10138
 
7.7%
c 7413
 
5.7%
t 7162
 
5.5%
s 6614
 
5.0%
p 6243
 
4.8%
G 6167
 
4.7%
d 6099
 
4.6%
Other values (32) 39183
29.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 101255
77.2%
Uppercase Letter 23104
 
17.6%
Space Separator 5500
 
4.2%
Decimal Number 1160
 
0.9%
Other Punctuation 144
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 16291
16.1%
e 15708
15.5%
n 10145
10.0%
u 10138
10.0%
c 7413
7.3%
t 7162
7.1%
s 6614
6.5%
p 6243
 
6.2%
d 6099
 
6.0%
f 5957
 
5.9%
Other values (10) 9485
9.4%
Uppercase Letter
ValueCountFrequency (%)
G 6167
26.7%
S 4128
17.9%
P 4106
17.8%
E 2531
11.0%
L 1254
 
5.4%
O 1254
 
5.4%
N 923
 
4.0%
V 917
 
4.0%
T 296
 
1.3%
C 290
 
1.3%
Other values (6) 1238
 
5.4%
Decimal Number
ValueCountFrequency (%)
4 290
25.0%
1 290
25.0%
0 290
25.0%
2 290
25.0%
Space Separator
ValueCountFrequency (%)
5500
100.0%
Other Punctuation
ValueCountFrequency (%)
. 144
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 124359
94.8%
Common 6804
 
5.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 16291
13.1%
e 15708
12.6%
n 10145
 
8.2%
u 10138
 
8.2%
c 7413
 
6.0%
t 7162
 
5.8%
s 6614
 
5.3%
p 6243
 
5.0%
G 6167
 
5.0%
d 6099
 
4.9%
Other values (26) 32379
26.0%
Common
ValueCountFrequency (%)
5500
80.8%
4 290
 
4.3%
1 290
 
4.3%
0 290
 
4.3%
2 290
 
4.3%
. 144
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 131163
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 16291
12.4%
e 15708
12.0%
n 10145
 
7.7%
u 10138
 
7.7%
c 7413
 
5.7%
t 7162
 
5.5%
s 6614
 
5.0%
p 6243
 
4.8%
G 6167
 
4.7%
d 6099
 
4.6%
Other values (32) 39183
29.9%

georeferenceRemarks
Text

Missing 

Distinct562
Distinct (%)4.3%
Missing5661
Missing (%)30.0%
Memory size147.5 KiB
2025-01-08T18:33:11.150419image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length570
Median length446
Mean length102.1251799
Min length8

Characters and Unicode

Total characters1348563
Distinct characters84
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique291 ?
Unique (%)2.2%

Sample

1st rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
2nd rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
3rd rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
4th rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
5th rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
ValueCountFrequency (%)
for 11797
 
5.4%
km 11604
 
5.3%
radius 10782
 
5.0%
georeference 7659
 
3.5%
to 6875
 
3.2%
by 5881
 
2.7%
was 5876
 
2.7%
that 5847
 
2.7%
only 5832
 
2.7%
ex 5813
 
2.7%
Other values (1631) 139388
64.1%
2025-01-08T18:33:11.403517image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204184
15.1%
e 140668
 
10.4%
r 102754
 
7.6%
i 71247
 
5.3%
o 67231
 
5.0%
s 56371
 
4.2%
a 55673
 
4.1%
n 55507
 
4.1%
t 49761
 
3.7%
d 47386
 
3.5%
Other values (74) 497781
36.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 936316
69.4%
Space Separator 204186
 
15.1%
Decimal Number 112539
 
8.3%
Uppercase Letter 62764
 
4.7%
Other Punctuation 22473
 
1.7%
Dash Punctuation 9955
 
0.7%
Open Punctuation 131
 
< 0.1%
Close Punctuation 131
 
< 0.1%
Math Symbol 66
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 140668
15.0%
r 102754
11.0%
i 71247
 
7.6%
o 67231
 
7.2%
s 56371
 
6.0%
a 55673
 
5.9%
n 55507
 
5.9%
t 49761
 
5.3%
d 47386
 
5.1%
c 38921
 
4.2%
Other values (16) 250797
26.8%
Uppercase Letter
ValueCountFrequency (%)
S 12725
20.3%
F 7867
12.5%
M 7849
12.5%
A 7349
11.7%
D 5962
9.5%
G 2949
 
4.7%
C 2406
 
3.8%
L 2224
 
3.5%
O 2065
 
3.3%
N 1923
 
3.1%
Other values (16) 9445
15.0%
Decimal Number
ValueCountFrequency (%)
1 37695
33.5%
0 27546
24.5%
2 17351
15.4%
9 12223
 
10.9%
4 11178
 
9.9%
5 2315
 
2.1%
6 1714
 
1.5%
8 1016
 
0.9%
3 924
 
0.8%
7 577
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 8935
39.8%
. 6543
29.1%
: 3957
17.6%
/ 2222
 
9.9%
; 391
 
1.7%
' 293
 
1.3%
" 60
 
0.3%
& 48
 
0.2%
? 16
 
0.1%
% 8
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 43
65.2%
+ 20
30.3%
~ 3
 
4.5%
Space Separator
ValueCountFrequency (%)
204184
> 99.9%
  2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 118
90.1%
[ 13
 
9.9%
Close Punctuation
ValueCountFrequency (%)
) 118
90.1%
] 13
 
9.9%
Dash Punctuation
ValueCountFrequency (%)
- 9955
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Currency Symbol
ValueCountFrequency (%)
¤ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 999080
74.1%
Common 349483
 
25.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 140668
14.1%
r 102754
 
10.3%
i 71247
 
7.1%
o 67231
 
6.7%
s 56371
 
5.6%
a 55673
 
5.6%
n 55507
 
5.6%
t 49761
 
5.0%
d 47386
 
4.7%
c 38921
 
3.9%
Other values (42) 313561
31.4%
Common
ValueCountFrequency (%)
204184
58.4%
1 37695
 
10.8%
0 27546
 
7.9%
2 17351
 
5.0%
9 12223
 
3.5%
4 11178
 
3.2%
- 9955
 
2.8%
, 8935
 
2.6%
. 6543
 
1.9%
: 3957
 
1.1%
Other values (22) 9916
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1348560
> 99.9%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
204184
15.1%
e 140668
 
10.4%
r 102754
 
7.6%
i 71247
 
5.3%
o 67231
 
5.0%
s 56371
 
4.2%
a 55673
 
4.1%
n 55507
 
4.1%
t 49761
 
3.7%
d 47386
 
3.5%
Other values (72) 497778
36.9%
None
ValueCountFrequency (%)
  2
66.7%
¤ 1
33.3%

typeStatus
Text

Missing 

Distinct5
Distinct (%)22.7%
Missing18844
Missing (%)99.9%
Memory size147.5 KiB
2025-01-08T18:33:11.456404image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.090909091
Min length8

Characters and Unicode

Total characters178
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)9.1%

Sample

1st rowHYPOTYPE
2nd rowPARATYPE
3rd rowHYPOTYPE
4th rowHYPOTYPE
5th rowHYPOTYPE
ValueCountFrequency (%)
hypotype 13
59.1%
paratype 5
 
22.7%
topotype 2
 
9.1%
plesiotype 1
 
4.5%
holotype 1
 
4.5%
2025-01-08T18:33:11.562515image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
P 43
24.2%
Y 35
19.7%
T 24
13.5%
E 23
12.9%
O 20
11.2%
H 14
 
7.9%
A 10
 
5.6%
R 5
 
2.8%
L 2
 
1.1%
S 1
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 178
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
P 43
24.2%
Y 35
19.7%
T 24
13.5%
E 23
12.9%
O 20
11.2%
H 14
 
7.9%
A 10
 
5.6%
R 5
 
2.8%
L 2
 
1.1%
S 1
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 178
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
P 43
24.2%
Y 35
19.7%
T 24
13.5%
E 23
12.9%
O 20
11.2%
H 14
 
7.9%
A 10
 
5.6%
R 5
 
2.8%
L 2
 
1.1%
S 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 178
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
P 43
24.2%
Y 35
19.7%
T 24
13.5%
E 23
12.9%
O 20
11.2%
H 14
 
7.9%
A 10
 
5.6%
R 5
 
2.8%
L 2
 
1.1%
S 1
 
0.6%

identifiedBy
Text

Missing 

Distinct46
Distinct (%)4.1%
Missing17735
Missing (%)94.0%
Memory size147.5 KiB
2025-01-08T18:33:11.656662image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length26
Median length21
Mean length15.7020336
Min length6

Characters and Unicode

Total characters17759
Distinct characters50
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.9%

Sample

1st rowGary P. Aronsen
2nd rowGary P. Aronsen
3rd rowJosé A. Ottenwalder
4th rowAngus J. Mossman
5th rowAngus J. Mossman
ValueCountFrequency (%)
jordan 278
 
8.9%
colosi 278
 
8.9%
g 278
 
8.9%
a 247
 
7.9%
mary 240
 
7.7%
turner 240
 
7.7%
kristof 101
 
3.2%
zyskowski 101
 
3.2%
alex 100
 
3.2%
dornburg 100
 
3.2%
Other values (91) 1159
37.1%
2025-01-08T18:33:11.822097image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1991
 
11.2%
r 1773
 
10.0%
o 1434
 
8.1%
n 1105
 
6.2%
a 1041
 
5.9%
e 976
 
5.5%
s 880
 
5.0%
i 864
 
4.9%
. 854
 
4.8%
l 730
 
4.1%
Other values (40) 6111
34.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 11749
66.2%
Uppercase Letter 3145
 
17.7%
Space Separator 1991
 
11.2%
Other Punctuation 854
 
4.8%
Dash Punctuation 20
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 1773
15.1%
o 1434
12.2%
n 1105
9.4%
a 1041
8.9%
e 976
8.3%
s 880
7.5%
i 864
7.4%
l 730
 
6.2%
d 462
 
3.9%
y 415
 
3.5%
Other values (15) 2069
17.6%
Uppercase Letter
ValueCountFrequency (%)
A 467
14.8%
J 384
12.2%
C 371
11.8%
M 341
10.8%
G 309
9.8%
K 263
8.4%
T 244
7.8%
D 107
 
3.4%
N 103
 
3.3%
Z 101
 
3.2%
Other values (12) 455
14.5%
Space Separator
ValueCountFrequency (%)
1991
100.0%
Other Punctuation
ValueCountFrequency (%)
. 854
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14894
83.9%
Common 2865
 
16.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 1773
 
11.9%
o 1434
 
9.6%
n 1105
 
7.4%
a 1041
 
7.0%
e 976
 
6.6%
s 880
 
5.9%
i 864
 
5.8%
l 730
 
4.9%
A 467
 
3.1%
d 462
 
3.1%
Other values (37) 5162
34.7%
Common
ValueCountFrequency (%)
1991
69.5%
. 854
29.8%
- 20
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17758
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1991
 
11.2%
r 1773
 
10.0%
o 1434
 
8.1%
n 1105
 
6.2%
a 1041
 
5.9%
e 976
 
5.5%
s 880
 
5.0%
i 864
 
4.9%
. 854
 
4.8%
l 730
 
4.1%
Other values (39) 6110
34.4%
None
ValueCountFrequency (%)
é 1
100.0%

dateIdentified
Text

Missing 

Distinct26
Distinct (%)2.7%
Missing17913
Missing (%)94.9%
Memory size147.5 KiB
2025-01-08T18:33:11.892287image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters18107
Distinct characters13
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st row2016-01-01T00:00:00
2nd row2016-01-01T00:00:00
3rd row1985-01-01T00:00:00
4th row2016-01-01T00:00:00
5th row2016-01-01T00:00:00
ValueCountFrequency (%)
2008-01-01t00:00:00 271
28.4%
2009-01-01t00:00:00 257
27.0%
2007-01-01t00:00:00 130
13.6%
2012-01-01t00:00:00 126
13.2%
2016-01-01t00:00:00 26
 
2.7%
2011-01-01t00:00:00 22
 
2.3%
2020-01-01t00:00:00 22
 
2.3%
2010-01-01t00:00:00 22
 
2.3%
2024-01-01t00:00:00 18
 
1.9%
2023-01-01t00:00:00 15
 
1.6%
Other values (16) 44
 
4.6%
2025-01-08T18:33:12.009054image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9280
51.3%
1 2156
 
11.9%
- 1906
 
10.5%
: 1906
 
10.5%
2 1137
 
6.3%
T 953
 
5.3%
8 276
 
1.5%
9 274
 
1.5%
7 132
 
0.7%
6 33
 
0.2%
Other values (3) 54
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13342
73.7%
Dash Punctuation 1906
 
10.5%
Other Punctuation 1906
 
10.5%
Uppercase Letter 953
 
5.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9280
69.6%
1 2156
 
16.2%
2 1137
 
8.5%
8 276
 
2.1%
9 274
 
2.1%
7 132
 
1.0%
6 33
 
0.2%
4 25
 
0.2%
3 21
 
0.2%
5 8
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1906
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1906
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 953
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17154
94.7%
Latin 953
 
5.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9280
54.1%
1 2156
 
12.6%
- 1906
 
11.1%
: 1906
 
11.1%
2 1137
 
6.6%
8 276
 
1.6%
9 274
 
1.6%
7 132
 
0.8%
6 33
 
0.2%
4 25
 
0.1%
Other values (2) 29
 
0.2%
Latin
ValueCountFrequency (%)
T 953
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18107
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9280
51.3%
1 2156
 
11.9%
- 1906
 
10.5%
: 1906
 
10.5%
2 1137
 
6.3%
T 953
 
5.3%
8 276
 
1.5%
9 274
 
1.5%
7 132
 
0.7%
6 33
 
0.2%
Other values (3) 54
 
0.3%

identificationRemarks
Text

Missing 

Distinct3
Distinct (%)100.0%
Missing18863
Missing (%)> 99.9%
Memory size147.5 KiB
2025-01-08T18:33:12.075716image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length57
Median length6
Mean length22.66666667
Min length5

Characters and Unicode

Total characters68
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st rowreferenced on page 89 in the descripton of Agouti thomasi
2nd rowEaton
3rd rowThorpe
ValueCountFrequency (%)
referenced 1
8.3%
on 1
8.3%
page 1
8.3%
89 1
8.3%
in 1
8.3%
the 1
8.3%
descripton 1
8.3%
of 1
8.3%
agouti 1
8.3%
thomasi 1
8.3%
Other values (2) 2
16.7%
2025-01-08T18:33:12.185389image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
13.2%
e 8
11.8%
o 7
10.3%
n 5
 
7.4%
t 5
 
7.4%
r 4
 
5.9%
i 4
 
5.9%
h 3
 
4.4%
a 3
 
4.4%
p 3
 
4.4%
Other values (12) 17
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 54
79.4%
Space Separator 9
 
13.2%
Uppercase Letter 3
 
4.4%
Decimal Number 2
 
2.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 8
14.8%
o 7
13.0%
n 5
9.3%
t 5
9.3%
r 4
 
7.4%
i 4
 
7.4%
h 3
 
5.6%
a 3
 
5.6%
p 3
 
5.6%
g 2
 
3.7%
Other values (6) 10
18.5%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
A 1
33.3%
T 1
33.3%
Decimal Number
ValueCountFrequency (%)
8 1
50.0%
9 1
50.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 57
83.8%
Common 11
 
16.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 8
14.0%
o 7
12.3%
n 5
 
8.8%
t 5
 
8.8%
r 4
 
7.0%
i 4
 
7.0%
h 3
 
5.3%
a 3
 
5.3%
p 3
 
5.3%
g 2
 
3.5%
Other values (9) 13
22.8%
Common
ValueCountFrequency (%)
9
81.8%
8 1
 
9.1%
9 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 68
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9
13.2%
e 8
11.8%
o 7
10.3%
n 5
 
7.4%
t 5
 
7.4%
r 4
 
5.9%
i 4
 
5.9%
h 3
 
4.4%
a 3
 
4.4%
p 3
 
4.4%
Other values (12) 17
25.0%
Distinct1854
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:12.373465image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length110
Median length59
Mean length32.02750981
Min length6

Characters and Unicode

Total characters604231
Distinct characters75
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique618 ?
Unique (%)3.3%

Sample

1st rowTamias striatus fisheri A.H.Howell, 1925
2nd rowPeromyscus leucopus (Rafinesque, 1818)
3rd rowPeromyscus leucopus (Rafinesque, 1818)
4th rowPeromyscus leucopus (Rafinesque, 1818)
5th rowPeromyscus leucopus (Rafinesque, 1818)
ValueCountFrequency (%)
linnaeus 2062
 
2.9%
peromyscus 1837
 
2.5%
1758 1574
 
2.2%
1830 1569
 
2.2%
cinereus 1490
 
2.1%
sorex 1193
 
1.7%
brevicauda 1124
 
1.6%
blarina 976
 
1.4%
zibethicus 898
 
1.2%
talpoides 867
 
1.2%
Other values (2496) 58473
81.1%
2025-01-08T18:33:12.635867image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53197
 
8.8%
s 44621
 
7.4%
a 41941
 
6.9%
e 40071
 
6.6%
i 39594
 
6.6%
u 33817
 
5.6%
r 32580
 
5.4%
n 27699
 
4.6%
o 26491
 
4.4%
l 20698
 
3.4%
Other values (65) 243522
40.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 424685
70.3%
Decimal Number 56520
 
9.4%
Space Separator 53197
 
8.8%
Uppercase Letter 35563
 
5.9%
Other Punctuation 16413
 
2.7%
Open Punctuation 8827
 
1.5%
Close Punctuation 8827
 
1.5%
Dash Punctuation 199
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 44621
10.5%
a 41941
9.9%
e 40071
9.4%
i 39594
9.3%
u 33817
 
8.0%
r 32580
 
7.7%
n 27699
 
6.5%
o 26491
 
6.2%
l 20698
 
4.9%
c 20108
 
4.7%
Other values (20) 97065
22.9%
Uppercase Letter
ValueCountFrequency (%)
P 3596
 
10.1%
L 3097
 
8.7%
S 3064
 
8.6%
M 3059
 
8.6%
C 2876
 
8.1%
G 2711
 
7.6%
B 2540
 
7.1%
R 1786
 
5.0%
T 1753
 
4.9%
O 1690
 
4.8%
Other values (17) 9391
26.4%
Decimal Number
ValueCountFrequency (%)
1 17134
30.3%
8 12257
21.7%
7 5671
 
10.0%
9 4373
 
7.7%
5 3669
 
6.5%
0 3595
 
6.4%
3 3256
 
5.8%
6 2466
 
4.4%
2 2350
 
4.2%
4 1749
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 14230
86.7%
. 1751
 
10.7%
& 427
 
2.6%
' 5
 
< 0.1%
Space Separator
ValueCountFrequency (%)
53197
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8827
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8827
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 199
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 460248
76.2%
Common 143983
 
23.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 44621
 
9.7%
a 41941
 
9.1%
e 40071
 
8.7%
i 39594
 
8.6%
u 33817
 
7.3%
r 32580
 
7.1%
n 27699
 
6.0%
o 26491
 
5.8%
l 20698
 
4.5%
c 20108
 
4.4%
Other values (47) 132628
28.8%
Common
ValueCountFrequency (%)
53197
36.9%
1 17134
 
11.9%
, 14230
 
9.9%
8 12257
 
8.5%
( 8827
 
6.1%
) 8827
 
6.1%
7 5671
 
3.9%
9 4373
 
3.0%
5 3669
 
2.5%
0 3595
 
2.5%
Other values (8) 12203
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 603778
99.9%
None 453
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53197
 
8.8%
s 44621
 
7.4%
a 41941
 
6.9%
e 40071
 
6.6%
i 39594
 
6.6%
u 33817
 
5.6%
r 32580
 
5.4%
n 27699
 
4.6%
o 26491
 
4.4%
l 20698
 
3.4%
Other values (60) 243069
40.3%
None
ValueCountFrequency (%)
É 250
55.2%
ü 136
30.0%
è 28
 
6.2%
é 25
 
5.5%
ö 14
 
3.1%
Distinct256
Distinct (%)1.4%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:12.792119image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length231
Median length222
Mean length176.5778336
Min length30

Characters and Unicode

Total characters3304301
Distinct characters50
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)0.1%

Sample

1st rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Sciuromorpha; Sciurida; Sciuridae; Xerinae
2nd rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
3rd rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
4th rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
5th rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
ValueCountFrequency (%)
animalia 18713
 
8.8%
vertebrata 18713
 
8.8%
chordata 18713
 
8.8%
amniota 18711
 
8.8%
mammalia 18711
 
8.8%
theriiformes-----theria-placentalia-epitheria 15223
 
7.1%
rodentia 8426
 
3.9%
preptotheria-anagalida-simplicidentata 8425
 
3.9%
myomorpha 5919
 
2.8%
myodonta 5717
 
2.7%
Other values (374) 76277
35.7%
2025-01-08T18:33:13.020926image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 452709
13.7%
i 335054
 
10.1%
e 250563
 
7.6%
r 228161
 
6.9%
t 207108
 
6.3%
; 194835
 
5.9%
194835
 
5.9%
o 167342
 
5.1%
- 154910
 
4.7%
n 124331
 
3.8%
Other values (40) 994453
30.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2464975
74.6%
Uppercase Letter 294746
 
8.9%
Other Punctuation 194835
 
5.9%
Space Separator 194835
 
5.9%
Dash Punctuation 154910
 
4.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 452709
18.4%
i 335054
13.6%
e 250563
10.2%
r 228161
9.3%
t 207108
8.4%
o 167342
 
6.8%
n 124331
 
5.0%
m 121823
 
4.9%
l 112560
 
4.6%
h 109691
 
4.4%
Other values (14) 355633
14.4%
Uppercase Letter
ValueCountFrequency (%)
A 54733
18.6%
M 41057
13.9%
T 37832
12.8%
P 36866
12.5%
C 34091
11.6%
E 20860
 
7.1%
S 20718
 
7.0%
V 19658
 
6.7%
R 9908
 
3.4%
F 3662
 
1.2%
Other values (13) 15361
 
5.2%
Other Punctuation
ValueCountFrequency (%)
; 194835
100.0%
Space Separator
ValueCountFrequency (%)
194835
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 154910
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2759721
83.5%
Common 544580
 
16.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 452709
16.4%
i 335054
12.1%
e 250563
 
9.1%
r 228161
 
8.3%
t 207108
 
7.5%
o 167342
 
6.1%
n 124331
 
4.5%
m 121823
 
4.4%
l 112560
 
4.1%
h 109691
 
4.0%
Other values (37) 650379
23.6%
Common
ValueCountFrequency (%)
; 194835
35.8%
194835
35.8%
- 154910
28.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3304301
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 452709
13.7%
i 335054
 
10.1%
e 250563
 
7.6%
r 228161
 
6.9%
t 207108
 
6.3%
; 194835
 
5.9%
194835
 
5.9%
o 167342
 
5.1%
- 154910
 
4.7%
n 124331
 
3.8%
Other values (40) 994453
30.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:13.071391image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.048340931
Min length8

Characters and Unicode

Total characters151840
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAnimalia
2nd rowAnimalia
3rd rowAnimalia
4th rowAnimalia
5th rowAnimalia
ValueCountFrequency (%)
animalia 18714
98.4%
incertae 152
 
0.8%
sedis 152
 
0.8%
2025-01-08T18:33:13.175293image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 37732
24.8%
a 37580
24.7%
n 18866
12.4%
A 18714
12.3%
m 18714
12.3%
l 18714
12.3%
e 456
 
0.3%
s 304
 
0.2%
c 152
 
0.1%
r 152
 
0.1%
Other values (3) 456
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 132974
87.6%
Uppercase Letter 18714
 
12.3%
Space Separator 152
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 37732
28.4%
a 37580
28.3%
n 18866
14.2%
m 18714
14.1%
l 18714
14.1%
e 456
 
0.3%
s 304
 
0.2%
c 152
 
0.1%
r 152
 
0.1%
t 152
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
A 18714
100.0%
Space Separator
ValueCountFrequency (%)
152
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 151688
99.9%
Common 152
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 37732
24.9%
a 37580
24.8%
n 18866
12.4%
A 18714
12.3%
m 18714
12.3%
l 18714
12.3%
e 456
 
0.3%
s 304
 
0.2%
c 152
 
0.1%
r 152
 
0.1%
Other values (2) 304
 
0.2%
Common
ValueCountFrequency (%)
152
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 151840
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 37732
24.8%
a 37580
24.7%
n 18866
12.4%
A 18714
12.3%
m 18714
12.3%
l 18714
12.3%
e 456
 
0.3%
s 304
 
0.2%
c 152
 
0.1%
r 152
 
0.1%
Other values (3) 456
 
0.3%

phylum
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:13.216005image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters149712
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowChordata
2nd rowChordata
3rd rowChordata
4th rowChordata
5th rowChordata
ValueCountFrequency (%)
chordata 18714
100.0%
2025-01-08T18:33:13.309381image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 37428
25.0%
C 18714
12.5%
h 18714
12.5%
o 18714
12.5%
r 18714
12.5%
d 18714
12.5%
t 18714
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 130998
87.5%
Uppercase Letter 18714
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 37428
28.6%
h 18714
14.3%
o 18714
14.3%
r 18714
14.3%
d 18714
14.3%
t 18714
14.3%
Uppercase Letter
ValueCountFrequency (%)
C 18714
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 149712
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 37428
25.0%
C 18714
12.5%
h 18714
12.5%
o 18714
12.5%
r 18714
12.5%
d 18714
12.5%
t 18714
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149712
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 37428
25.0%
C 18714
12.5%
h 18714
12.5%
o 18714
12.5%
r 18714
12.5%
d 18714
12.5%
t 18714
12.5%

class
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing154
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:13.349331image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters149696
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMammalia
2nd rowMammalia
3rd rowMammalia
4th rowMammalia
5th rowMammalia
ValueCountFrequency (%)
mammalia 18712
100.0%
2025-01-08T18:33:13.442522image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 56136
37.5%
m 37424
25.0%
M 18712
 
12.5%
l 18712
 
12.5%
i 18712
 
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 130984
87.5%
Uppercase Letter 18712
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 56136
42.9%
m 37424
28.6%
l 18712
 
14.3%
i 18712
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
M 18712
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 149696
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 56136
37.5%
m 37424
25.0%
M 18712
 
12.5%
l 18712
 
12.5%
i 18712
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149696
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 56136
37.5%
m 37424
25.0%
M 18712
 
12.5%
l 18712
 
12.5%
i 18712
 
12.5%

order
Text

Missing 

Distinct27
Distinct (%)0.1%
Missing406
Missing (%)2.2%
Memory size147.5 KiB
2025-01-08T18:33:13.501572image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length16
Median length8
Mean length9.43624052
Min length6

Characters and Unicode

Total characters174193
Distinct characters32
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRodentia
2nd rowRodentia
3rd rowRodentia
4th rowRodentia
5th rowRodentia
ValueCountFrequency (%)
rodentia 8426
45.6%
soricomorpha 2476
 
13.4%
carnivora 2371
 
12.8%
artiodactyla 1529
 
8.3%
chiroptera 1102
 
6.0%
primates 953
 
5.2%
lagomorpha 348
 
1.9%
diprotodontia 248
 
1.3%
didelphimorphia 213
 
1.2%
perissodactyla 157
 
0.9%
Other values (17) 637
 
3.5%
2025-01-08T18:33:13.622083image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 23323
13.4%
o 23289
13.4%
i 18719
10.7%
r 15878
9.1%
t 14533
8.3%
e 11474
 
6.6%
n 11293
 
6.5%
d 10796
 
6.2%
R 8426
 
4.8%
p 4723
 
2.7%
Other values (22) 31739
18.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 155733
89.4%
Uppercase Letter 18460
 
10.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 23323
15.0%
o 23289
15.0%
i 18719
12.0%
r 15878
10.2%
t 14533
9.3%
e 11474
7.4%
n 11293
7.3%
d 10796
6.9%
p 4723
 
3.0%
c 4545
 
2.9%
Other values (10) 17160
11.0%
Uppercase Letter
ValueCountFrequency (%)
R 8426
45.6%
C 3679
19.9%
S 2507
 
13.6%
A 1588
 
8.6%
P 1254
 
6.8%
D 495
 
2.7%
L 348
 
1.9%
M 87
 
0.5%
E 41
 
0.2%
H 29
 
0.2%
Other values (2) 6
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 174193
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 23323
13.4%
o 23289
13.4%
i 18719
10.7%
r 15878
9.1%
t 14533
8.3%
e 11474
 
6.6%
n 11293
 
6.5%
d 10796
 
6.2%
R 8426
 
4.8%
p 4723
 
2.7%
Other values (22) 31739
18.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 174193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 23323
13.4%
o 23289
13.4%
i 18719
10.7%
r 15878
9.1%
t 14533
8.3%
e 11474
 
6.6%
n 11293
 
6.5%
d 10796
 
6.2%
R 8426
 
4.8%
p 4723
 
2.7%
Other values (22) 31739
18.2%

family
Text

Missing 

Distinct134
Distinct (%)0.7%
Missing684
Missing (%)3.6%
Memory size147.5 KiB
2025-01-08T18:33:13.757840image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length16
Mean length9.657463425
Min length6

Characters and Unicode

Total characters175592
Distinct characters42
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)< 0.1%

Sample

1st rowSciuridae
2nd rowCricetidae
3rd rowCricetidae
4th rowCricetidae
5th rowCricetidae
ValueCountFrequency (%)
cricetidae 4133
22.7%
soricidae 2286
 
12.6%
sciuridae 1673
 
9.2%
muridae 1068
 
5.9%
bovidae 837
 
4.6%
canidae 662
 
3.6%
dipodidae 459
 
2.5%
mustelidae 440
 
2.4%
cercopithecidae 421
 
2.3%
vespertilionidae 405
 
2.2%
Other values (124) 5798
31.9%
2025-01-08T18:33:13.952107image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 29346
16.7%
e 27872
15.9%
a 20776
11.8%
d 19527
11.1%
r 13293
7.6%
c 10178
 
5.8%
o 8753
 
5.0%
t 7151
 
4.1%
C 6000
 
3.4%
S 4067
 
2.3%
Other values (32) 28629
16.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 157410
89.6%
Uppercase Letter 18182
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 29346
18.6%
e 27872
17.7%
a 20776
13.2%
d 19527
12.4%
r 13293
8.4%
c 10178
 
6.5%
o 8753
 
5.6%
t 7151
 
4.5%
u 3652
 
2.3%
l 3087
 
2.0%
Other values (12) 13775
8.8%
Uppercase Letter
ValueCountFrequency (%)
C 6000
33.0%
S 4067
22.4%
M 1930
 
10.6%
P 1153
 
6.3%
D 866
 
4.8%
B 863
 
4.7%
H 485
 
2.7%
V 476
 
2.6%
L 448
 
2.5%
F 391
 
2.2%
Other values (10) 1503
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 175592
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 29346
16.7%
e 27872
15.9%
a 20776
11.8%
d 19527
11.1%
r 13293
7.6%
c 10178
 
5.8%
o 8753
 
5.0%
t 7151
 
4.1%
C 6000
 
3.4%
S 4067
 
2.3%
Other values (32) 28629
16.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 175592
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 29346
16.7%
e 27872
15.9%
a 20776
11.8%
d 19527
11.1%
r 13293
7.6%
c 10178
 
5.8%
o 8753
 
5.0%
t 7151
 
4.1%
C 6000
 
3.4%
S 4067
 
2.3%
Other values (32) 28629
16.3%

genus
Text

Missing 

Distinct610
Distinct (%)3.5%
Missing1248
Missing (%)6.6%
Memory size147.5 KiB
2025-01-08T18:33:14.145147image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length14
Mean length7.820581224
Min length3

Characters and Unicode

Total characters137783
Distinct characters48
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)0.6%

Sample

1st rowTamias
2nd rowPeromyscus
3rd rowPeromyscus
4th rowPeromyscus
5th rowPeromyscus
ValueCountFrequency (%)
peromyscus 1837
 
10.4%
sorex 1183
 
6.7%
blarina 976
 
5.5%
myodes 742
 
4.2%
ondatra 631
 
3.6%
microtus 430
 
2.4%
tamias 398
 
2.3%
napaeozapus 365
 
2.1%
canis 345
 
2.0%
procyon 329
 
1.9%
Other values (600) 10382
58.9%
2025-01-08T18:33:14.396327image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 15209
 
11.0%
o 12469
 
9.0%
a 11828
 
8.6%
r 10473
 
7.6%
e 9289
 
6.7%
u 9207
 
6.7%
i 7477
 
5.4%
c 6205
 
4.5%
y 5919
 
4.3%
t 5122
 
3.7%
Other values (38) 44585
32.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 120165
87.2%
Uppercase Letter 17618
 
12.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 15209
12.7%
o 12469
10.4%
a 11828
9.8%
r 10473
 
8.7%
e 9289
 
7.7%
u 9207
 
7.7%
i 7477
 
6.2%
c 6205
 
5.2%
y 5919
 
4.9%
t 5122
 
4.3%
Other values (15) 26967
22.4%
Uppercase Letter
ValueCountFrequency (%)
P 3036
17.2%
M 2506
14.2%
S 1903
10.8%
C 1609
9.1%
O 1280
7.3%
T 1205
 
6.8%
B 1189
 
6.7%
L 659
 
3.7%
N 645
 
3.7%
A 595
 
3.4%
Other values (13) 2991
17.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 137783
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 15209
 
11.0%
o 12469
 
9.0%
a 11828
 
8.6%
r 10473
 
7.6%
e 9289
 
6.7%
u 9207
 
6.7%
i 7477
 
5.4%
c 6205
 
4.5%
y 5919
 
4.3%
t 5122
 
3.7%
Other values (38) 44585
32.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 137783
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 15209
 
11.0%
o 12469
 
9.0%
a 11828
 
8.6%
r 10473
 
7.6%
e 9289
 
6.7%
u 9207
 
6.7%
i 7477
 
5.4%
c 6205
 
4.5%
y 5919
 
4.3%
t 5122
 
3.7%
Other values (38) 44585
32.4%

genericName
Text

Missing 

Distinct610
Distinct (%)3.5%
Missing1248
Missing (%)6.6%
Memory size147.5 KiB
2025-01-08T18:33:14.587019image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.097911227
Min length3

Characters and Unicode

Total characters142669
Distinct characters47
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)0.6%

Sample

1st rowTamias
2nd rowPeromyscus
3rd rowPeromyscus
4th rowPeromyscus
5th rowPeromyscus
ValueCountFrequency (%)
peromyscus 1837
 
10.4%
sorex 1193
 
6.8%
blarina 976
 
5.5%
clethrionomys 742
 
4.2%
ondatra 631
 
3.6%
microtus 434
 
2.5%
tamias 398
 
2.3%
napaeozapus 365
 
2.1%
canis 345
 
2.0%
procyon 329
 
1.9%
Other values (600) 10368
58.8%
2025-01-08T18:33:14.933268image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 15048
 
10.5%
o 13419
 
9.4%
a 11754
 
8.2%
r 11226
 
7.9%
e 9339
 
6.5%
u 9072
 
6.4%
i 8183
 
5.7%
c 6202
 
4.3%
y 5916
 
4.1%
m 5689
 
4.0%
Other values (37) 46821
32.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 125051
87.7%
Uppercase Letter 17618
 
12.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 15048
12.0%
o 13419
10.7%
a 11754
9.4%
r 11226
 
9.0%
e 9339
 
7.5%
u 9072
 
7.3%
i 8183
 
6.5%
c 6202
 
5.0%
y 5916
 
4.7%
m 5689
 
4.5%
Other values (14) 29203
23.4%
Uppercase Letter
ValueCountFrequency (%)
P 3033
17.2%
C 2307
13.1%
S 1906
10.8%
M 1595
9.1%
O 1307
7.4%
T 1213
 
6.9%
B 1189
 
6.7%
N 830
 
4.7%
L 660
 
3.7%
A 584
 
3.3%
Other values (13) 2994
17.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 142669
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 15048
 
10.5%
o 13419
 
9.4%
a 11754
 
8.2%
r 11226
 
7.9%
e 9339
 
6.5%
u 9072
 
6.4%
i 8183
 
5.7%
c 6202
 
4.3%
y 5916
 
4.1%
m 5689
 
4.0%
Other values (37) 46821
32.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 142669
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 15048
 
10.5%
o 13419
 
9.4%
a 11754
 
8.2%
r 11226
 
7.9%
e 9339
 
6.5%
u 9072
 
6.4%
i 8183
 
5.7%
c 6202
 
4.3%
y 5916
 
4.1%
m 5689
 
4.0%
Other values (37) 46821
32.8%

specificEpithet
Text

Missing 

Distinct949
Distinct (%)5.8%
Missing2554
Missing (%)13.5%
Memory size147.5 KiB
2025-01-08T18:33:15.114624image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length16
Median length14
Mean length8.579144188
Min length2

Characters and Unicode

Total characters139943
Distinct characters27
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)1.4%

Sample

1st rowstriatus
2nd rowleucopus
3rd rowleucopus
4th rowleucopus
5th rowleucopus
ValueCountFrequency (%)
brevicauda 986
 
6.0%
leucopus 775
 
4.8%
cinereus 747
 
4.6%
gapperi 708
 
4.3%
maniculatus 683
 
4.2%
zibethicus 631
 
3.9%
insignis 365
 
2.2%
lotor 326
 
2.0%
canadensis 320
 
2.0%
hudsonicus 292
 
1.8%
Other values (939) 10479
64.2%
2025-01-08T18:33:15.357093image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 16095
11.5%
s 14920
10.7%
u 14661
10.5%
a 13492
9.6%
e 10462
 
7.5%
n 9056
 
6.5%
r 8917
 
6.4%
c 8678
 
6.2%
l 6251
 
4.5%
t 5825
 
4.2%
Other values (17) 31586
22.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 139941
> 99.9%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 16095
11.5%
s 14920
10.7%
u 14661
10.5%
a 13492
9.6%
e 10462
 
7.5%
n 9056
 
6.5%
r 8917
 
6.4%
c 8678
 
6.2%
l 6251
 
4.5%
t 5825
 
4.2%
Other values (16) 31584
22.6%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 139941
> 99.9%
Common 2
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 16095
11.5%
s 14920
10.7%
u 14661
10.5%
a 13492
9.6%
e 10462
 
7.5%
n 9056
 
6.5%
r 8917
 
6.4%
c 8678
 
6.2%
l 6251
 
4.5%
t 5825
 
4.2%
Other values (16) 31584
22.6%
Common
ValueCountFrequency (%)
- 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 139943
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 16095
11.5%
s 14920
10.7%
u 14661
10.5%
a 13492
9.6%
e 10462
 
7.5%
n 9056
 
6.5%
r 8917
 
6.4%
c 8678
 
6.2%
l 6251
 
4.5%
t 5825
 
4.2%
Other values (17) 31586
22.6%

infraspecificEpithet
Text

Missing 

Distinct583
Distinct (%)8.1%
Missing11638
Missing (%)61.7%
Memory size147.5 KiB
2025-01-08T18:33:15.521467image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length16
Median length14
Mean length8.710154953
Min length3

Characters and Unicode

Total characters62957
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique203 ?
Unique (%)2.8%

Sample

1st rowfisheri
2nd rowdomesticus
3rd rowdomesticus
4th rowdomesticus
5th rowdomesticus
ValueCountFrequency (%)
talpoides 835
 
11.6%
cinereus 743
 
10.3%
pennsylvanicus 303
 
4.2%
fumeus 275
 
3.8%
zibethicus 267
 
3.7%
domesticus 193
 
2.7%
lucifugus 155
 
2.1%
maniculatus 146
 
2.0%
brevicauda 138
 
1.9%
fulvus 119
 
1.6%
Other values (573) 4054
56.1%
2025-01-08T18:33:15.744886image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 7421
11.8%
i 6875
10.9%
e 6109
9.7%
u 5693
9.0%
a 5422
 
8.6%
n 4224
 
6.7%
c 3632
 
5.8%
l 3386
 
5.4%
r 3314
 
5.3%
t 3035
 
4.8%
Other values (16) 13846
22.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 62957
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 7421
11.8%
i 6875
10.9%
e 6109
9.7%
u 5693
9.0%
a 5422
 
8.6%
n 4224
 
6.7%
c 3632
 
5.8%
l 3386
 
5.4%
r 3314
 
5.3%
t 3035
 
4.8%
Other values (16) 13846
22.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 62957
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 7421
11.8%
i 6875
10.9%
e 6109
9.7%
u 5693
9.0%
a 5422
 
8.6%
n 4224
 
6.7%
c 3632
 
5.8%
l 3386
 
5.4%
r 3314
 
5.3%
t 3035
 
4.8%
Other values (16) 13846
22.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 62957
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 7421
11.8%
i 6875
10.9%
e 6109
9.7%
u 5693
9.0%
a 5422
 
8.6%
n 4224
 
6.7%
c 3632
 
5.8%
l 3386
 
5.4%
r 3314
 
5.3%
t 3035
 
4.8%
Other values (16) 13846
22.0%
Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:15.803888image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length7
Mean length7.924732323
Min length5

Characters and Unicode

Total characters149508
Distinct characters19
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSUBSPECIES
2nd rowSPECIES
3rd rowSPECIES
4th rowSPECIES
5th rowSPECIES
ValueCountFrequency (%)
species 9084
48.2%
subspecies 7228
38.3%
genus 1306
 
6.9%
family 564
 
3.0%
order 282
 
1.5%
class 248
 
1.3%
kingdom 152
 
0.8%
phylum 2
 
< 0.1%
2025-01-08T18:33:15.907212image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
S 41654
27.9%
E 34212
22.9%
I 17028
11.4%
C 16560
 
11.1%
P 16314
 
10.9%
U 8536
 
5.7%
B 7228
 
4.8%
G 1458
 
1.0%
N 1458
 
1.0%
L 814
 
0.5%
Other values (9) 4246
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 149508
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 41654
27.9%
E 34212
22.9%
I 17028
11.4%
C 16560
 
11.1%
P 16314
 
10.9%
U 8536
 
5.7%
B 7228
 
4.8%
G 1458
 
1.0%
N 1458
 
1.0%
L 814
 
0.5%
Other values (9) 4246
 
2.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 149508
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 41654
27.9%
E 34212
22.9%
I 17028
11.4%
C 16560
 
11.1%
P 16314
 
10.9%
U 8536
 
5.7%
B 7228
 
4.8%
G 1458
 
1.0%
N 1458
 
1.0%
L 814
 
0.5%
Other values (9) 4246
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149508
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S 41654
27.9%
E 34212
22.9%
I 17028
11.4%
C 16560
 
11.1%
P 16314
 
10.9%
U 8536
 
5.7%
B 7228
 
4.8%
G 1458
 
1.0%
N 1458
 
1.0%
L 814
 
0.5%
Other values (9) 4246
 
2.8%
Distinct1166
Distinct (%)6.2%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:16.084841image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length143
Median length121
Mean length82.7007428
Min length31

Characters and Unicode

Total characters1547579
Distinct characters60
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique294 ?
Unique (%)1.6%

Sample

1st rowEastern Chipmunk; chipmunks; squirrels; rodents; mammals; vertebrates; chordates; animals
2nd rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
3rd rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
4th rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
5th rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
ValueCountFrequency (%)
mammals 18748
 
11.1%
vertebrates 18713
 
11.1%
chordates 18713
 
11.1%
animals 18713
 
11.1%
rodents 8561
 
5.1%
mice 7296
 
4.3%
carnivores 4733
 
2.8%
shrews 3336
 
2.0%
mouse 2787
 
1.7%
squirrels 2585
 
1.5%
Other values (1028) 64018
38.1%
2025-01-08T18:33:16.339580image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 152163
 
9.8%
e 149527
 
9.7%
149490
 
9.7%
s 133715
 
8.6%
; 118068
 
7.6%
r 116349
 
7.5%
t 95576
 
6.2%
m 94542
 
6.1%
o 69125
 
4.5%
l 60718
 
3.9%
Other values (50) 408306
26.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1230864
79.5%
Space Separator 149490
 
9.7%
Other Punctuation 118585
 
7.7%
Uppercase Letter 39691
 
2.6%
Dash Punctuation 8820
 
0.6%
Final Punctuation 128
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 152163
12.4%
e 149527
12.1%
s 133715
10.9%
r 116349
9.5%
t 95576
 
7.8%
m 94542
 
7.7%
o 69125
 
5.6%
l 60718
 
4.9%
i 59895
 
4.9%
n 53167
 
4.3%
Other values (17) 246087
20.0%
Uppercase Letter
ValueCountFrequency (%)
S 6525
16.4%
M 5563
14.0%
W 3144
 
7.9%
R 2708
 
6.8%
B 2683
 
6.8%
A 2449
 
6.2%
N 2218
 
5.6%
C 1850
 
4.7%
G 1827
 
4.6%
V 1298
 
3.3%
Other values (15) 9426
23.7%
Other Punctuation
ValueCountFrequency (%)
; 118068
99.6%
' 509
 
0.4%
. 4
 
< 0.1%
? 4
 
< 0.1%
Space Separator
ValueCountFrequency (%)
149490
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8820
100.0%
Final Punctuation
ValueCountFrequency (%)
128
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1270555
82.1%
Common 277024
 
17.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 152163
12.0%
e 149527
11.8%
s 133715
10.5%
r 116349
9.2%
t 95576
 
7.5%
m 94542
 
7.4%
o 69125
 
5.4%
l 60718
 
4.8%
i 59895
 
4.7%
n 53167
 
4.2%
Other values (42) 285778
22.5%
Common
ValueCountFrequency (%)
149490
54.0%
; 118068
42.6%
- 8820
 
3.2%
' 509
 
0.2%
128
 
< 0.1%
. 4
 
< 0.1%
? 4
 
< 0.1%
 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1547439
> 99.9%
Punctuation 128
 
< 0.1%
None 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 152163
 
9.8%
e 149527
 
9.7%
149490
 
9.7%
s 133715
 
8.6%
; 118068
 
7.6%
r 116349
 
7.5%
t 95576
 
6.2%
m 94542
 
6.1%
o 69125
 
4.5%
l 60718
 
3.9%
Other values (47) 408166
26.4%
Punctuation
ValueCountFrequency (%)
128
100.0%
None
ValueCountFrequency (%)
ü 11
91.7%
 1
 
8.3%

nomenclaturalCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:16.391579image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters75464
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowICZN
2nd rowICZN
3rd rowICZN
4th rowICZN
5th rowICZN
ValueCountFrequency (%)
iczn 18866
100.0%
2025-01-08T18:33:16.483586image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 75464
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 75464
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75464
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%
Distinct3
Distinct (%)< 0.1%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:16.527791image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.848722881
Min length7

Characters and Unicode

Total characters146881
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowACCEPTED
2nd rowACCEPTED
3rd rowACCEPTED
4th rowACCEPTED
5th rowACCEPTED
ValueCountFrequency (%)
accepted 15796
84.4%
synonym 2831
 
15.1%
doubtful 87
 
0.5%
2025-01-08T18:33:16.624476image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 31592
21.5%
E 31592
21.5%
T 15883
10.8%
D 15883
10.8%
A 15796
10.8%
P 15796
10.8%
Y 5662
 
3.9%
N 5662
 
3.9%
O 2918
 
2.0%
S 2831
 
1.9%
Other values (5) 3266
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 146881
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 31592
21.5%
E 31592
21.5%
T 15883
10.8%
D 15883
10.8%
A 15796
10.8%
P 15796
10.8%
Y 5662
 
3.9%
N 5662
 
3.9%
O 2918
 
2.0%
S 2831
 
1.9%
Other values (5) 3266
 
2.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 146881
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 31592
21.5%
E 31592
21.5%
T 15883
10.8%
D 15883
10.8%
A 15796
10.8%
P 15796
10.8%
Y 5662
 
3.9%
N 5662
 
3.9%
O 2918
 
2.0%
S 2831
 
1.9%
Other values (5) 3266
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 146881
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 31592
21.5%
E 31592
21.5%
T 15883
10.8%
D 15883
10.8%
A 15796
10.8%
P 15796
10.8%
Y 5662
 
3.9%
N 5662
 
3.9%
O 2918
 
2.0%
S 2831
 
1.9%
Other values (5) 3266
 
2.2%

taxonRemarks
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:16.671325image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length41
Median length41
Mean length41
Min length41

Characters and Unicode

Total characters773506
Distinct characters18
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAnimals and Plants: Vertebrates - Mammals
2nd rowAnimals and Plants: Vertebrates - Mammals
3rd rowAnimals and Plants: Vertebrates - Mammals
4th rowAnimals and Plants: Vertebrates - Mammals
5th rowAnimals and Plants: Vertebrates - Mammals
ValueCountFrequency (%)
animals 18866
16.7%
and 18866
16.7%
plants 18866
16.7%
vertebrates 18866
16.7%
18866
16.7%
mammals 18866
16.7%
2025-01-08T18:33:16.774348image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 113196
14.6%
94330
12.2%
s 75464
9.8%
e 56598
 
7.3%
m 56598
 
7.3%
l 56598
 
7.3%
n 56598
 
7.3%
t 56598
 
7.3%
r 37732
 
4.9%
A 18866
 
2.4%
Other values (8) 150928
19.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 565980
73.2%
Space Separator 94330
 
12.2%
Uppercase Letter 75464
 
9.8%
Dash Punctuation 18866
 
2.4%
Other Punctuation 18866
 
2.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 113196
20.0%
s 75464
13.3%
e 56598
10.0%
m 56598
10.0%
l 56598
10.0%
n 56598
10.0%
t 56598
10.0%
r 37732
 
6.7%
b 18866
 
3.3%
d 18866
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
A 18866
25.0%
P 18866
25.0%
V 18866
25.0%
M 18866
25.0%
Space Separator
ValueCountFrequency (%)
94330
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18866
100.0%
Other Punctuation
ValueCountFrequency (%)
: 18866
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 641444
82.9%
Common 132062
 
17.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 113196
17.6%
s 75464
11.8%
e 56598
8.8%
m 56598
8.8%
l 56598
8.8%
n 56598
8.8%
t 56598
8.8%
r 37732
 
5.9%
A 18866
 
2.9%
b 18866
 
2.9%
Other values (5) 94330
14.7%
Common
ValueCountFrequency (%)
94330
71.4%
- 18866
 
14.3%
: 18866
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 773506
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 113196
14.6%
94330
12.2%
s 75464
9.8%
e 56598
 
7.3%
m 56598
 
7.3%
l 56598
 
7.3%
n 56598
 
7.3%
t 56598
 
7.3%
r 37732
 
4.9%
A 18866
 
2.4%
Other values (8) 150928
19.5%

datasetKey
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:16.825067image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters679176
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row854f602e-f762-11e1-a439-00145eb45e9a
2nd row854f602e-f762-11e1-a439-00145eb45e9a
3rd row854f602e-f762-11e1-a439-00145eb45e9a
4th row854f602e-f762-11e1-a439-00145eb45e9a
5th row854f602e-f762-11e1-a439-00145eb45e9a
ValueCountFrequency (%)
854f602e-f762-11e1-a439-00145eb45e9a 18866
100.0%
2025-01-08T18:33:16.929972image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 75464
11.1%
e 75464
11.1%
- 75464
11.1%
1 75464
11.1%
5 56598
8.3%
0 56598
8.3%
f 37732
 
5.6%
6 37732
 
5.6%
2 37732
 
5.6%
a 37732
 
5.6%
Other values (5) 113196
16.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 433918
63.9%
Lowercase Letter 169794
 
25.0%
Dash Punctuation 75464
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 75464
17.4%
1 75464
17.4%
5 56598
13.0%
0 56598
13.0%
6 37732
8.7%
2 37732
8.7%
9 37732
8.7%
8 18866
 
4.3%
7 18866
 
4.3%
3 18866
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
e 75464
44.4%
f 37732
22.2%
a 37732
22.2%
b 18866
 
11.1%
Dash Punctuation
ValueCountFrequency (%)
- 75464
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 509382
75.0%
Latin 169794
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 75464
14.8%
- 75464
14.8%
1 75464
14.8%
5 56598
11.1%
0 56598
11.1%
6 37732
7.4%
2 37732
7.4%
9 37732
7.4%
8 18866
 
3.7%
7 18866
 
3.7%
Latin
ValueCountFrequency (%)
e 75464
44.4%
f 37732
22.2%
a 37732
22.2%
b 18866
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 679176
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 75464
11.1%
e 75464
11.1%
- 75464
11.1%
1 75464
11.1%
5 56598
8.3%
0 56598
8.3%
f 37732
 
5.6%
6 37732
 
5.6%
2 37732
 
5.6%
a 37732
 
5.6%
Other values (5) 113196
16.7%

publishingCountry
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:16.968474image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters37732
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUS
2nd rowUS
3rd rowUS
4th rowUS
5th rowUS
ValueCountFrequency (%)
us 18866
100.0%
2025-01-08T18:33:17.060374image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
U 18866
50.0%
S 18866
50.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 37732
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
U 18866
50.0%
S 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 37732
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
U 18866
50.0%
S 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37732
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
U 18866
50.0%
S 18866
50.0%
Distinct4639
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:17.152465image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length23.9970317
Min length20

Characters and Unicode

Total characters452728
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique411 ?
Unique (%)2.2%

Sample

1st row2025-01-08T13:41:37.071Z
2nd row2025-01-08T13:41:37.575Z
3rd row2025-01-08T13:41:36.570Z
4th row2025-01-08T13:41:33.336Z
5th row2025-01-08T13:41:31.987Z
ValueCountFrequency (%)
2025-01-08t13:41:34.959z 13
 
0.1%
2025-01-08t13:41:37.272z 12
 
0.1%
2025-01-08t13:41:35.416z 12
 
0.1%
2025-01-08t13:41:37.511z 11
 
0.1%
2025-01-08t13:41:37.284z 11
 
0.1%
2025-01-08t13:41:35.688z 11
 
0.1%
2025-01-08t13:41:34.086z 11
 
0.1%
2025-01-08t13:41:31.753z 11
 
0.1%
2025-01-08t13:41:34.469z 11
 
0.1%
2025-01-08t13:41:36.351z 10
 
0.1%
Other values (4629) 18753
99.4%
2025-01-08T18:33:17.318821image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 62891
13.9%
0 62226
13.7%
3 46125
10.2%
2 45206
10.0%
- 37732
8.3%
: 37732
8.3%
4 27889
6.2%
5 27713
6.1%
8 24504
 
5.4%
T 18866
 
4.2%
Other values (5) 61844
13.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 320680
70.8%
Other Punctuation 56584
 
12.5%
Dash Punctuation 37732
 
8.3%
Uppercase Letter 37732
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 62891
19.6%
0 62226
19.4%
3 46125
14.4%
2 45206
14.1%
4 27889
8.7%
5 27713
8.6%
8 24504
 
7.6%
6 9517
 
3.0%
7 9009
 
2.8%
9 5600
 
1.7%
Other Punctuation
ValueCountFrequency (%)
: 37732
66.7%
. 18852
33.3%
Uppercase Letter
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 414996
91.7%
Latin 37732
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
1 62891
15.2%
0 62226
15.0%
3 46125
11.1%
2 45206
10.9%
- 37732
9.1%
: 37732
9.1%
4 27889
6.7%
5 27713
6.7%
8 24504
 
5.9%
. 18852
 
4.5%
Other values (3) 24126
 
5.8%
Latin
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 452728
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 62891
13.9%
0 62226
13.7%
3 46125
10.2%
2 45206
10.0%
- 37732
8.3%
: 37732
8.3%
4 27889
6.2%
5 27713
6.1%
8 24504
 
5.4%
T 18866
 
4.2%
Other values (5) 61844
13.7%

elevation
Text

Missing 

Distinct156
Distinct (%)10.6%
Missing17391
Missing (%)92.2%
Memory size147.5 KiB
2025-01-08T18:33:17.424606image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.444067797
Min length3

Characters and Unicode

Total characters8030
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)2.6%

Sample

1st row61.0
2nd row61.0
3rd row638.0
4th row638.0
5th row1143.0
ValueCountFrequency (%)
1829.0 124
 
8.4%
61.0 104
 
7.1%
2896.0 60
 
4.1%
700.0 59
 
4.0%
2134.0 59
 
4.0%
638.0 56
 
3.8%
1000.0 53
 
3.6%
500.0 42
 
2.8%
1402.0 29
 
2.0%
1280.0 29
 
2.0%
Other values (146) 860
58.3%
2025-01-08T18:33:17.587355image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2400
29.9%
. 1475
18.4%
1 934
 
11.6%
2 633
 
7.9%
8 506
 
6.3%
6 445
 
5.5%
9 373
 
4.6%
3 369
 
4.6%
7 309
 
3.8%
5 294
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6555
81.6%
Other Punctuation 1475
 
18.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2400
36.6%
1 934
 
14.2%
2 633
 
9.7%
8 506
 
7.7%
6 445
 
6.8%
9 373
 
5.7%
3 369
 
5.6%
7 309
 
4.7%
5 294
 
4.5%
4 292
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 1475
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8030
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2400
29.9%
. 1475
18.4%
1 934
 
11.6%
2 633
 
7.9%
8 506
 
6.3%
6 445
 
5.5%
9 373
 
4.6%
3 369
 
4.6%
7 309
 
3.8%
5 294
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8030
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2400
29.9%
. 1475
18.4%
1 934
 
11.6%
2 633
 
7.9%
8 506
 
6.3%
6 445
 
5.5%
9 373
 
4.6%
3 369
 
4.6%
7 309
 
3.8%
5 294
 
3.7%

elevationAccuracy
Text

Missing 

Distinct2
Distinct (%)0.3%
Missing18082
Missing (%)95.8%
Memory size147.5 KiB
2025-01-08T18:33:17.635656image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.005102041
Min length3

Characters and Unicode

Total characters2356
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0
ValueCountFrequency (%)
0.0 782
99.7%
152.5 2
 
0.3%
2025-01-08T18:33:17.732810image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1564
66.4%
. 784
33.3%
5 4
 
0.2%
1 2
 
0.1%
2 2
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1572
66.7%
Other Punctuation 784
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1564
99.5%
5 4
 
0.3%
1 2
 
0.1%
2 2
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 784
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2356
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1564
66.4%
. 784
33.3%
5 4
 
0.2%
1 2
 
0.1%
2 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2356
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1564
66.4%
. 784
33.3%
5 4
 
0.2%
1 2
 
0.1%
2 2
 
0.1%
Distinct14
Distinct (%)17.9%
Missing18788
Missing (%)99.6%
Memory size147.5 KiB
2025-01-08T18:33:17.788317image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length18
Median length17
Mean length15.24358974
Min length3

Characters and Unicode

Total characters1189
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)9.0%

Sample

1st row0.0
2nd row1678.9213806293344
3rd row0.0
4th row3317.269457389723
5th row4308.557461717021
ValueCountFrequency (%)
4308.557461717021 30
38.5%
1132.2847034170802 13
16.7%
0.0 12
 
15.4%
2569.2685781328946 9
 
11.5%
3322.3754451523614 3
 
3.8%
2427.113575024377 2
 
2.6%
4700.828968112741 2
 
2.6%
1678.9213806293344 1
 
1.3%
3317.269457389723 1
 
1.3%
2524.2049532876945 1
 
1.3%
Other values (4) 4
 
5.1%
2025-01-08T18:33:17.905326image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 166
14.0%
7 156
13.1%
0 138
11.6%
2 133
11.2%
4 120
10.1%
5 100
8.4%
3 98
8.2%
8 98
8.2%
. 78
6.6%
6 71
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1111
93.4%
Other Punctuation 78
 
6.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 166
14.9%
7 156
14.0%
0 138
12.4%
2 133
12.0%
4 120
10.8%
5 100
9.0%
3 98
8.8%
8 98
8.8%
6 71
6.4%
9 31
 
2.8%
Other Punctuation
ValueCountFrequency (%)
. 78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1189
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 166
14.0%
7 156
13.1%
0 138
11.6%
2 133
11.2%
4 120
10.1%
5 100
8.4%
3 98
8.2%
8 98
8.2%
. 78
6.6%
6 71
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1189
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 166
14.0%
7 156
13.1%
0 138
11.6%
2 133
11.2%
4 120
10.1%
5 100
8.4%
3 98
8.2%
8 98
8.2%
. 78
6.6%
6 71
6.0%

issue
Text

Distinct46
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:17.965302image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length178
Median length72
Mean length79.85301601
Min length72

Characters and Unicode

Total characters1506507
Distinct characters28
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)< 0.1%

Sample

1st rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY
2nd rowTAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY
3rd rowTAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY
4th rowTAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY
5th rowTAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY
ValueCountFrequency (%)
occurrence_status_inferred_from_individual_count;institution_match_fuzzy 12822
68.0%
taxon_match_higherrank;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 3130
 
16.6%
coordinate_rounded;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 1102
 
5.8%
continent_coordinate_mismatch;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 347
 
1.8%
recorded_date_mismatch;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 299
 
1.6%
coordinate_reprojected;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 181
 
1.0%
taxon_match_none;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 152
 
0.8%
taxon_match_fuzzy;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 139
 
0.7%
coordinate_rounded;taxon_match_higherrank;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 130
 
0.7%
continent_derived_from_coordinates;occurrence_status_inferred_from_individual_count;institution_match_fuzzy 80
 
0.4%
Other values (36) 484
 
2.6%
2025-01-08T18:33:18.089111image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
T 144679
 
9.6%
_ 143839
 
9.5%
I 139261
 
9.2%
N 125769
 
8.3%
U 115218
 
7.6%
R 106323
 
7.1%
C 102478
 
6.8%
O 86554
 
5.7%
E 85787
 
5.7%
A 71280
 
4.7%
Other values (18) 385319
25.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1336919
88.7%
Connector Punctuation 143839
 
9.5%
Other Punctuation 25381
 
1.7%
Decimal Number 368
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 144679
10.8%
I 139261
10.4%
N 125769
9.4%
U 115218
 
8.6%
R 106323
 
8.0%
C 102478
 
7.7%
O 86554
 
6.5%
E 85787
 
6.4%
A 71280
 
5.3%
D 63686
 
4.8%
Other values (14) 295884
22.1%
Decimal Number
ValueCountFrequency (%)
8 184
50.0%
4 184
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 143839
100.0%
Other Punctuation
ValueCountFrequency (%)
; 25381
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1336919
88.7%
Common 169588
 
11.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 144679
10.8%
I 139261
10.4%
N 125769
9.4%
U 115218
 
8.6%
R 106323
 
8.0%
C 102478
 
7.7%
O 86554
 
6.5%
E 85787
 
6.4%
A 71280
 
5.3%
D 63686
 
4.8%
Other values (14) 295884
22.1%
Common
ValueCountFrequency (%)
_ 143839
84.8%
; 25381
 
15.0%
8 184
 
0.1%
4 184
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1506507
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
T 144679
 
9.6%
_ 143839
 
9.5%
I 139261
 
9.2%
N 125769
 
8.3%
U 115218
 
7.6%
R 106323
 
7.1%
C 102478
 
6.8%
O 86554
 
5.7%
E 85787
 
5.7%
A 71280
 
4.7%
Other values (18) 385319
25.6%

mediaType
Text

Constant  Missing 

Distinct1
Distinct (%)0.2%
Missing18411
Missing (%)97.6%
Memory size147.5 KiB
2025-01-08T18:33:18.132666image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters4550
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStillImage
2nd rowStillImage
3rd rowStillImage
4th rowStillImage
5th rowStillImage
ValueCountFrequency (%)
stillimage 455
100.0%
2025-01-08T18:33:18.222170image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 910
20.0%
S 455
10.0%
t 455
10.0%
i 455
10.0%
I 455
10.0%
m 455
10.0%
a 455
10.0%
g 455
10.0%
e 455
10.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3640
80.0%
Uppercase Letter 910
 
20.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 910
25.0%
t 455
12.5%
i 455
12.5%
m 455
12.5%
a 455
12.5%
g 455
12.5%
e 455
12.5%
Uppercase Letter
ValueCountFrequency (%)
S 455
50.0%
I 455
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4550
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 910
20.0%
S 455
10.0%
t 455
10.0%
i 455
10.0%
I 455
10.0%
m 455
10.0%
a 455
10.0%
g 455
10.0%
e 455
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4550
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 910
20.0%
S 455
10.0%
t 455
10.0%
i 455
10.0%
I 455
10.0%
m 455
10.0%
a 455
10.0%
g 455
10.0%
e 455
10.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:18.264507image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.293808969
Min length4

Characters and Unicode

Total characters81007
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowfalse
2nd rowfalse
3rd rowfalse
4th rowfalse
5th rowfalse
ValueCountFrequency (%)
true 13323
70.6%
false 5543
29.4%
2025-01-08T18:33:18.355892image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 18866
23.3%
t 13323
16.4%
r 13323
16.4%
u 13323
16.4%
f 5543
 
6.8%
a 5543
 
6.8%
l 5543
 
6.8%
s 5543
 
6.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 81007
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 18866
23.3%
t 13323
16.4%
r 13323
16.4%
u 13323
16.4%
f 5543
 
6.8%
a 5543
 
6.8%
l 5543
 
6.8%
s 5543
 
6.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 81007
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 18866
23.3%
t 13323
16.4%
r 13323
16.4%
u 13323
16.4%
f 5543
 
6.8%
a 5543
 
6.8%
l 5543
 
6.8%
s 5543
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 81007
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 18866
23.3%
t 13323
16.4%
r 13323
16.4%
u 13323
16.4%
f 5543
 
6.8%
a 5543
 
6.8%
l 5543
 
6.8%
s 5543
 
6.8%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:18.396891image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.997985795
Min length4

Characters and Unicode

Total characters94292
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowfalse
2nd rowfalse
3rd rowfalse
4th rowfalse
5th rowfalse
ValueCountFrequency (%)
false 18828
99.8%
true 38
 
0.2%
2025-01-08T18:33:18.496862image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 18866
20.0%
f 18828
20.0%
a 18828
20.0%
l 18828
20.0%
s 18828
20.0%
t 38
 
< 0.1%
r 38
 
< 0.1%
u 38
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 94292
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 18866
20.0%
f 18828
20.0%
a 18828
20.0%
l 18828
20.0%
s 18828
20.0%
t 38
 
< 0.1%
r 38
 
< 0.1%
u 38
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 94292
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 18866
20.0%
f 18828
20.0%
a 18828
20.0%
l 18828
20.0%
s 18828
20.0%
t 38
 
< 0.1%
r 38
 
< 0.1%
u 38
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94292
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 18866
20.0%
f 18828
20.0%
a 18828
20.0%
l 18828
20.0%
s 18828
20.0%
t 38
 
< 0.1%
r 38
 
< 0.1%
u 38
 
< 0.1%
Distinct1854
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:18.686465image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.769055444
Min length1

Characters and Unicode

Total characters127705
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique618 ?
Unique (%)3.3%

Sample

1st row4263596
2nd row2438019
3rd row2438019
4th row2438019
5th row2438019
ValueCountFrequency (%)
6163288 835
 
4.4%
2438019 774
 
4.1%
7059215 728
 
3.9%
2439137 691
 
3.7%
2437967 459
 
2.4%
2439461 365
 
1.9%
5219858 364
 
1.9%
7194100 275
 
1.5%
6163538 267
 
1.4%
359 248
 
1.3%
Other values (1844) 13860
73.5%
2025-01-08T18:33:18.958628image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 18522
14.5%
4 16888
13.2%
3 15746
12.3%
1 14417
11.3%
9 12202
9.6%
6 11986
9.4%
7 10030
7.9%
5 9755
7.6%
8 9707
7.6%
0 8452
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 127705
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 18522
14.5%
4 16888
13.2%
3 15746
12.3%
1 14417
11.3%
9 12202
9.6%
6 11986
9.4%
7 10030
7.9%
5 9755
7.6%
8 9707
7.6%
0 8452
6.6%

Most occurring scripts

ValueCountFrequency (%)
Common 127705
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 18522
14.5%
4 16888
13.2%
3 15746
12.3%
1 14417
11.3%
9 12202
9.6%
6 11986
9.4%
7 10030
7.9%
5 9755
7.6%
8 9707
7.6%
0 8452
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 127705
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 18522
14.5%
4 16888
13.2%
3 15746
12.3%
1 14417
11.3%
9 12202
9.6%
6 11986
9.4%
7 10030
7.9%
5 9755
7.6%
8 9707
7.6%
0 8452
6.6%
Distinct1774
Distinct (%)9.5%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:19.165823image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.811424602
Min length2

Characters and Unicode

Total characters127469
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique580 ?
Unique (%)3.1%

Sample

1st row4263596
2nd row2438019
3rd row2438019
4th row2438019
5th row2438019
ValueCountFrequency (%)
6163288 835
 
4.5%
2438019 775
 
4.1%
7059215 728
 
3.9%
5706760 708
 
3.8%
5219858 631
 
3.4%
2437967 528
 
2.8%
2439461 365
 
2.0%
7194100 275
 
1.5%
2438655 259
 
1.4%
359 248
 
1.3%
Other values (1764) 13362
71.4%
2025-01-08T18:33:19.429270image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 18721
14.7%
4 16423
12.9%
3 13842
10.9%
1 13333
10.5%
6 13306
10.4%
9 11427
9.0%
7 10399
8.2%
5 10313
8.1%
8 10107
7.9%
0 9598
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 127469
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 18721
14.7%
4 16423
12.9%
3 13842
10.9%
1 13333
10.5%
6 13306
10.4%
9 11427
9.0%
7 10399
8.2%
5 10313
8.1%
8 10107
7.9%
0 9598
7.5%

Most occurring scripts

ValueCountFrequency (%)
Common 127469
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 18721
14.7%
4 16423
12.9%
3 13842
10.9%
1 13333
10.5%
6 13306
10.4%
9 11427
9.0%
7 10399
8.2%
5 10313
8.1%
8 10107
7.9%
0 9598
7.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 127469
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 18721
14.7%
4 16423
12.9%
3 13842
10.9%
1 13333
10.5%
6 13306
10.4%
9 11427
9.0%
7 10399
8.2%
5 10313
8.1%
8 10107
7.9%
0 9598
7.5%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:19.484776image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters18866
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
1 18714
99.2%
0 152
 
0.8%
2025-01-08T18:33:19.580779image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 18714
99.2%
0 152
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18866
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 18714
99.2%
0 152
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
Common 18866
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 18714
99.2%
0 152
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18866
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 18714
99.2%
0 152
 
0.8%

phylumKey
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:19.619779image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters37428
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44
2nd row44
3rd row44
4th row44
5th row44
ValueCountFrequency (%)
44 18714
100.0%
2025-01-08T18:33:19.709817image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 37428
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37428
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 37428
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37428
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 37428
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37428
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 37428
100.0%

classKey
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing154
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:19.751741image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters56136
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row359
2nd row359
3rd row359
4th row359
5th row359
ValueCountFrequency (%)
359 18712
100.0%
2025-01-08T18:33:19.840265image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 18712
33.3%
5 18712
33.3%
9 18712
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 56136
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 18712
33.3%
5 18712
33.3%
9 18712
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 56136
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 18712
33.3%
5 18712
33.3%
9 18712
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56136
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 18712
33.3%
5 18712
33.3%
9 18712
33.3%

orderKey
Text

Missing 

Distinct27
Distinct (%)0.1%
Missing406
Missing (%)2.2%
Memory size147.5 KiB
2025-01-08T18:33:19.897268image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.474702059
Min length3

Characters and Unicode

Total characters64143
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1459
2nd row1459
3rd row1459
4th row1459
5th row1459
ValueCountFrequency (%)
1459 8426
45.6%
803 2476
 
13.4%
732 2371
 
12.8%
731 1529
 
8.3%
734 1102
 
6.0%
798 953
 
5.2%
785 348
 
1.9%
1452 248
 
1.3%
783 213
 
1.2%
795 157
 
0.9%
Other values (17) 637
 
3.5%
2025-01-08T18:33:20.005154image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 10349
16.1%
4 9983
15.6%
9 9799
15.3%
5 9346
14.6%
3 8072
12.6%
7 7095
11.1%
8 4213
6.6%
2 2730
 
4.3%
0 2511
 
3.9%
6 45
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 64143
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 10349
16.1%
4 9983
15.6%
9 9799
15.3%
5 9346
14.6%
3 8072
12.6%
7 7095
11.1%
8 4213
6.6%
2 2730
 
4.3%
0 2511
 
3.9%
6 45
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 64143
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 10349
16.1%
4 9983
15.6%
9 9799
15.3%
5 9346
14.6%
3 8072
12.6%
7 7095
11.1%
8 4213
6.6%
2 2730
 
4.3%
0 2511
 
3.9%
6 45
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 64143
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 10349
16.1%
4 9983
15.6%
9 9799
15.3%
5 9346
14.6%
3 8072
12.6%
7 7095
11.1%
8 4213
6.6%
2 2730
 
4.3%
0 2511
 
3.9%
6 45
 
0.1%

familyKey
Text

Missing 

Distinct134
Distinct (%)0.7%
Missing684
Missing (%)3.6%
Memory size147.5 KiB
2025-01-08T18:33:20.126386image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length4
Mean length4.745132549
Min length4

Characters and Unicode

Total characters86276
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)< 0.1%

Sample

1st row9456
2nd row3240723
3rd row3240723
4th row3240723
5th row3240723
ValueCountFrequency (%)
3240723 4133
22.7%
5534 2286
 
12.6%
9456 1673
 
9.2%
5510 1068
 
5.9%
9614 837
 
4.6%
9701 662
 
3.6%
9435 459
 
2.5%
5307 440
 
2.4%
9622 421
 
2.3%
9368 405
 
2.2%
Other values (124) 5798
31.9%
2025-01-08T18:33:20.314511image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 15416
17.9%
5 13473
15.6%
4 11897
13.8%
2 10515
12.2%
9 9211
10.7%
0 7550
8.8%
7 7183
8.3%
6 5300
 
6.1%
1 4201
 
4.9%
8 1530
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 86276
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 15416
17.9%
5 13473
15.6%
4 11897
13.8%
2 10515
12.2%
9 9211
10.7%
0 7550
8.8%
7 7183
8.3%
6 5300
 
6.1%
1 4201
 
4.9%
8 1530
 
1.8%

Most occurring scripts

ValueCountFrequency (%)
Common 86276
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 15416
17.9%
5 13473
15.6%
4 11897
13.8%
2 10515
12.2%
9 9211
10.7%
0 7550
8.8%
7 7183
8.3%
6 5300
 
6.1%
1 4201
 
4.9%
8 1530
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 86276
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 15416
17.9%
5 13473
15.6%
4 11897
13.8%
2 10515
12.2%
9 9211
10.7%
0 7550
8.8%
7 7183
8.3%
6 5300
 
6.1%
1 4201
 
4.9%
8 1530
 
1.8%

genusKey
Text

Missing 

Distinct612
Distinct (%)3.5%
Missing1248
Missing (%)6.6%
Memory size147.5 KiB
2025-01-08T18:33:20.512956image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length7
Mean length7.000908162
Min length7

Characters and Unicode

Total characters123342
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)0.6%

Sample

1st row2437422
2nd row2437961
3rd row2437961
4th row2437961
5th row2437961
ValueCountFrequency (%)
2437961 1837
 
10.4%
2435935 1183
 
6.7%
2435858 976
 
5.5%
2438724 742
 
4.2%
5219857 631
 
3.6%
2438591 430
 
2.4%
2437422 398
 
2.3%
2439460 365
 
2.1%
5219142 345
 
2.0%
2433592 329
 
1.9%
Other values (602) 10382
58.9%
2025-01-08T18:33:20.875165image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 24012
19.5%
4 21930
17.8%
3 20143
16.3%
5 11636
9.4%
9 10645
8.6%
7 8680
 
7.0%
8 8149
 
6.6%
1 7388
 
6.0%
6 6542
 
5.3%
0 4217
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 123342
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 24012
19.5%
4 21930
17.8%
3 20143
16.3%
5 11636
9.4%
9 10645
8.6%
7 8680
 
7.0%
8 8149
 
6.6%
1 7388
 
6.0%
6 6542
 
5.3%
0 4217
 
3.4%

Most occurring scripts

ValueCountFrequency (%)
Common 123342
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 24012
19.5%
4 21930
17.8%
3 20143
16.3%
5 11636
9.4%
9 10645
8.6%
7 8680
 
7.0%
8 8149
 
6.6%
1 7388
 
6.0%
6 6542
 
5.3%
0 4217
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 123342
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 24012
19.5%
4 21930
17.8%
3 20143
16.3%
5 11636
9.4%
9 10645
8.6%
7 8680
 
7.0%
8 8149
 
6.6%
1 7388
 
6.0%
6 6542
 
5.3%
0 4217
 
3.4%

speciesKey
Text

Missing 

Distinct1113
Distinct (%)6.8%
Missing2554
Missing (%)13.5%
Memory size147.5 KiB
2025-01-08T18:33:21.075855image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length7
Mean length7.003126533
Min length7

Characters and Unicode

Total characters114235
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)1.9%

Sample

1st row2437438
2nd row2438019
3rd row2438019
4th row2438019
5th row2438019
ValueCountFrequency (%)
2435862 975
 
6.0%
2438019 775
 
4.8%
2435964 739
 
4.5%
5706760 708
 
4.3%
2437967 683
 
4.2%
5219858 631
 
3.9%
2439461 365
 
2.2%
5218786 327
 
2.0%
2437282 292
 
1.8%
2435947 281
 
1.7%
Other values (1103) 10536
64.6%
2025-01-08T18:33:21.332253image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 20244
17.7%
4 17780
15.6%
3 14803
13.0%
5 10031
8.8%
6 9653
8.5%
8 9437
8.3%
9 9094
8.0%
7 8673
7.6%
1 7654
 
6.7%
0 6866
 
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 114235
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 20244
17.7%
4 17780
15.6%
3 14803
13.0%
5 10031
8.8%
6 9653
8.5%
8 9437
8.3%
9 9094
8.0%
7 8673
7.6%
1 7654
 
6.7%
0 6866
 
6.0%

Most occurring scripts

ValueCountFrequency (%)
Common 114235
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 20244
17.7%
4 17780
15.6%
3 14803
13.0%
5 10031
8.8%
6 9653
8.5%
8 9437
8.3%
9 9094
8.0%
7 8673
7.6%
1 7654
 
6.7%
0 6866
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 114235
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 20244
17.7%
4 17780
15.6%
3 14803
13.0%
5 10031
8.8%
6 9653
8.5%
8 9437
8.3%
9 9094
8.0%
7 8673
7.6%
1 7654
 
6.7%
0 6866
 
6.0%

species
Text

Missing 

Distinct1112
Distinct (%)6.8%
Missing2554
Missing (%)13.5%
Memory size147.5 KiB
2025-01-08T18:33:21.532127image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length29
Median length25
Mean length17.32203286
Min length9

Characters and Unicode

Total characters282557
Distinct characters50
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)1.9%

Sample

1st rowTamias striatus
2nd rowPeromyscus leucopus
3rd rowPeromyscus leucopus
4th rowPeromyscus leucopus
5th rowPeromyscus leucopus
ValueCountFrequency (%)
peromyscus 1633
 
5.0%
sorex 1176
 
3.6%
brevicauda 986
 
3.0%
blarina 976
 
3.0%
leucopus 775
 
2.4%
cinereus 747
 
2.3%
myodes 742
 
2.3%
gapperi 708
 
2.2%
maniculatus 683
 
2.1%
ondatra 631
 
1.9%
Other values (1485) 23567
72.2%
2025-01-08T18:33:21.792718image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 29016
 
10.3%
a 24276
 
8.6%
u 23247
 
8.2%
i 22681
 
8.0%
e 18844
 
6.7%
r 18410
 
6.5%
o 17053
 
6.0%
16312
 
5.8%
c 14413
 
5.1%
n 13336
 
4.7%
Other values (40) 84969
30.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 249933
88.5%
Space Separator 16312
 
5.8%
Uppercase Letter 16312
 
5.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 29016
11.6%
a 24276
9.7%
u 23247
9.3%
i 22681
 
9.1%
e 18844
 
7.5%
r 18410
 
7.4%
o 17053
 
6.8%
c 14413
 
5.8%
n 13336
 
5.3%
l 10927
 
4.4%
Other values (16) 57730
23.1%
Uppercase Letter
ValueCountFrequency (%)
P 2710
16.6%
M 2200
13.5%
S 1866
11.4%
C 1481
9.1%
O 1191
7.3%
B 1187
7.3%
T 1149
7.0%
N 619
 
3.8%
A 577
 
3.5%
D 533
 
3.3%
Other values (13) 2799
17.2%
Space Separator
ValueCountFrequency (%)
16312
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 266245
94.2%
Common 16312
 
5.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 29016
 
10.9%
a 24276
 
9.1%
u 23247
 
8.7%
i 22681
 
8.5%
e 18844
 
7.1%
r 18410
 
6.9%
o 17053
 
6.4%
c 14413
 
5.4%
n 13336
 
5.0%
l 10927
 
4.1%
Other values (39) 74042
27.8%
Common
ValueCountFrequency (%)
16312
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 282557
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 29016
 
10.3%
a 24276
 
8.6%
u 23247
 
8.2%
i 22681
 
8.0%
e 18844
 
6.7%
r 18410
 
6.5%
o 17053
 
6.0%
16312
 
5.8%
c 14413
 
5.1%
n 13336
 
4.7%
Other values (40) 84969
30.1%
Distinct1774
Distinct (%)9.5%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-08T18:33:21.988604image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length110
Median length58
Mean length31.81564604
Min length6

Characters and Unicode

Total characters595398
Distinct characters75
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique580 ?
Unique (%)3.1%

Sample

1st rowTamias striatus fisheri A.H.Howell, 1925
2nd rowPeromyscus leucopus (Rafinesque, 1818)
3rd rowPeromyscus leucopus (Rafinesque, 1818)
4th rowPeromyscus leucopus (Rafinesque, 1818)
5th rowPeromyscus leucopus (Rafinesque, 1818)
ValueCountFrequency (%)
linnaeus 2733
 
3.8%
1758 1977
 
2.8%
peromyscus 1837
 
2.6%
1830 1587
 
2.2%
cinereus 1487
 
2.1%
sorex 1183
 
1.6%
brevicauda 1124
 
1.6%
blarina 976
 
1.4%
talpoides 867
 
1.2%
1766 860
 
1.2%
Other values (2410) 57206
79.6%
2025-01-08T18:33:22.249862image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53123
 
8.9%
s 43421
 
7.3%
a 41483
 
7.0%
e 39164
 
6.6%
i 38084
 
6.4%
u 33788
 
5.7%
r 31615
 
5.3%
n 27100
 
4.6%
o 25118
 
4.2%
l 19859
 
3.3%
Other values (65) 242643
40.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 410635
69.0%
Decimal Number 58980
 
9.9%
Space Separator 53123
 
8.9%
Uppercase Letter 36255
 
6.1%
Other Punctuation 17087
 
2.9%
Open Punctuation 9556
 
1.6%
Close Punctuation 9556
 
1.6%
Dash Punctuation 206
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 43421
10.6%
a 41483
10.1%
e 39164
9.5%
i 38084
9.3%
u 33788
 
8.2%
r 31615
 
7.7%
n 27100
 
6.6%
o 25118
 
6.1%
l 19859
 
4.8%
c 19094
 
4.6%
Other values (20) 91909
22.4%
Uppercase Letter
ValueCountFrequency (%)
M 3933
10.8%
L 3788
10.4%
P 3612
 
10.0%
S 3105
 
8.6%
G 2600
 
7.2%
B 2447
 
6.7%
C 2194
 
6.1%
O 1923
 
5.3%
T 1717
 
4.7%
A 1619
 
4.5%
Other values (17) 9317
25.7%
Decimal Number
ValueCountFrequency (%)
1 17754
30.1%
8 12985
22.0%
7 6043
 
10.2%
5 4184
 
7.1%
9 3974
 
6.7%
0 3867
 
6.6%
3 3367
 
5.7%
6 2805
 
4.8%
2 2295
 
3.9%
4 1706
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 14845
86.9%
. 1934
 
11.3%
& 303
 
1.8%
' 5
 
< 0.1%
Space Separator
ValueCountFrequency (%)
53123
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9556
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9556
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 206
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 446890
75.1%
Common 148508
 
24.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 43421
 
9.7%
a 41483
 
9.3%
e 39164
 
8.8%
i 38084
 
8.5%
u 33788
 
7.6%
r 31615
 
7.1%
n 27100
 
6.1%
o 25118
 
5.6%
l 19859
 
4.4%
c 19094
 
4.3%
Other values (47) 128164
28.7%
Common
ValueCountFrequency (%)
53123
35.8%
1 17754
 
12.0%
, 14845
 
10.0%
8 12985
 
8.7%
( 9556
 
6.4%
) 9556
 
6.4%
7 6043
 
4.1%
5 4184
 
2.8%
9 3974
 
2.7%
0 3867
 
2.6%
Other values (8) 12621
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 594814
99.9%
None 584
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53123
 
8.9%
s 43421
 
7.3%
a 41483
 
7.0%
e 39164
 
6.6%
i 38084
 
6.4%
u 33788
 
5.7%
r 31615
 
5.3%
n 27100
 
4.6%
o 25118
 
4.2%
l 19859
 
3.3%
Other values (60) 242059
40.7%
None
ValueCountFrequency (%)
É 384
65.8%
ü 133
 
22.8%
è 28
 
4.8%
é 25
 
4.3%
ö 14
 
2.4%
Distinct2018
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:22.444272image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length43
Median length34
Mean length22.09201739
Min length3

Characters and Unicode

Total characters416788
Distinct characters53
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique703 ?
Unique (%)3.7%

Sample

1st rowTamias striatus fisheri
2nd rowPeromyscus leucopus noveboracensis
3rd rowPeromyscus leucopus noveboracensis
4th rowPeromyscus leucopus noveboracensis
5th rowPeromyscus leucopus noveboracensis
ValueCountFrequency (%)
peromyscus 1837
 
4.0%
cinereus 1489
 
3.2%
sorex 1193
 
2.6%
brevicauda 1125
 
2.4%
blarina 976
 
2.1%
zibethicus 898
 
2.0%
talpoides 868
 
1.9%
gapperi 848
 
1.8%
maniculatus 829
 
1.8%
leucopus 782
 
1.7%
Other values (2070) 35113
76.4%
2025-01-08T18:33:22.711049image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 41623
 
10.0%
i 36625
 
8.8%
a 35093
 
8.4%
u 30890
 
7.4%
e 30381
 
7.3%
27092
 
6.5%
r 26522
 
6.4%
o 25267
 
6.1%
n 22452
 
5.4%
c 20781
 
5.0%
Other values (43) 120062
28.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 370969
89.0%
Space Separator 27092
 
6.5%
Uppercase Letter 18716
 
4.5%
Other Punctuation 9
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 41623
11.2%
i 36625
9.9%
a 35093
9.5%
u 30890
 
8.3%
e 30381
 
8.2%
r 26522
 
7.1%
o 25267
 
6.8%
n 22452
 
6.1%
c 20781
 
5.6%
l 16432
 
4.4%
Other values (16) 84903
22.9%
Uppercase Letter
ValueCountFrequency (%)
P 3107
16.6%
C 2505
13.4%
S 1952
10.4%
M 1925
10.3%
B 1452
7.8%
O 1312
7.0%
T 1217
 
6.5%
N 831
 
4.4%
L 676
 
3.6%
A 598
 
3.2%
Other values (13) 3141
16.8%
Other Punctuation
ValueCountFrequency (%)
. 7
77.8%
? 2
 
22.2%
Space Separator
ValueCountFrequency (%)
27092
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 389685
93.5%
Common 27103
 
6.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 41623
10.7%
i 36625
 
9.4%
a 35093
 
9.0%
u 30890
 
7.9%
e 30381
 
7.8%
r 26522
 
6.8%
o 25267
 
6.5%
n 22452
 
5.8%
c 20781
 
5.3%
l 16432
 
4.2%
Other values (39) 103619
26.6%
Common
ValueCountFrequency (%)
27092
> 99.9%
. 7
 
< 0.1%
? 2
 
< 0.1%
- 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 416788
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 41623
 
10.0%
i 36625
 
8.8%
a 35093
 
8.4%
u 30890
 
7.4%
e 30381
 
7.3%
27092
 
6.5%
r 26522
 
6.4%
o 25267
 
6.1%
n 22452
 
5.4%
c 20781
 
5.0%
Other values (43) 120062
28.8%

protocol
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:22.763193image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters56598
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEML
2nd rowEML
3rd rowEML
4th rowEML
5th rowEML
ValueCountFrequency (%)
eml 18866
100.0%
2025-01-08T18:33:22.856671image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 18866
33.3%
M 18866
33.3%
L 18866
33.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 56598
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 18866
33.3%
M 18866
33.3%
L 18866
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 56598
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 18866
33.3%
M 18866
33.3%
L 18866
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56598
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 18866
33.3%
M 18866
33.3%
L 18866
33.3%
Distinct4639
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:22.948115image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length23.9970317
Min length20

Characters and Unicode

Total characters452728
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique411 ?
Unique (%)2.2%

Sample

1st row2025-01-08T13:41:37.071Z
2nd row2025-01-08T13:41:37.575Z
3rd row2025-01-08T13:41:36.570Z
4th row2025-01-08T13:41:33.336Z
5th row2025-01-08T13:41:31.987Z
ValueCountFrequency (%)
2025-01-08t13:41:34.959z 13
 
0.1%
2025-01-08t13:41:37.272z 12
 
0.1%
2025-01-08t13:41:35.416z 12
 
0.1%
2025-01-08t13:41:37.511z 11
 
0.1%
2025-01-08t13:41:37.284z 11
 
0.1%
2025-01-08t13:41:35.688z 11
 
0.1%
2025-01-08t13:41:34.086z 11
 
0.1%
2025-01-08t13:41:31.753z 11
 
0.1%
2025-01-08t13:41:34.469z 11
 
0.1%
2025-01-08t13:41:36.351z 10
 
0.1%
Other values (4629) 18753
99.4%
2025-01-08T18:33:23.118175image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 62891
13.9%
0 62226
13.7%
3 46125
10.2%
2 45206
10.0%
- 37732
8.3%
: 37732
8.3%
4 27889
6.2%
5 27713
6.1%
8 24504
 
5.4%
T 18866
 
4.2%
Other values (5) 61844
13.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 320680
70.8%
Other Punctuation 56584
 
12.5%
Dash Punctuation 37732
 
8.3%
Uppercase Letter 37732
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 62891
19.6%
0 62226
19.4%
3 46125
14.4%
2 45206
14.1%
4 27889
8.7%
5 27713
8.6%
8 24504
 
7.6%
6 9517
 
3.0%
7 9009
 
2.8%
9 5600
 
1.7%
Other Punctuation
ValueCountFrequency (%)
: 37732
66.7%
. 18852
33.3%
Uppercase Letter
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 414996
91.7%
Latin 37732
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
1 62891
15.2%
0 62226
15.0%
3 46125
11.1%
2 45206
10.9%
- 37732
9.1%
: 37732
9.1%
4 27889
6.7%
5 27713
6.7%
8 24504
 
5.9%
. 18852
 
4.5%
Other values (3) 24126
 
5.8%
Latin
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 452728
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 62891
13.9%
0 62226
13.7%
3 46125
10.2%
2 45206
10.0%
- 37732
8.3%
: 37732
8.3%
4 27889
6.2%
5 27713
6.1%
8 24504
 
5.4%
T 18866
 
4.2%
Other values (5) 61844
13.7%

lastCrawled
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:23.178106image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters452784
Distinct characters12
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2025-01-08T13:41:11.140Z
2nd row2025-01-08T13:41:11.140Z
3rd row2025-01-08T13:41:11.140Z
4th row2025-01-08T13:41:11.140Z
5th row2025-01-08T13:41:11.140Z
ValueCountFrequency (%)
2025-01-08t13:41:11.140z 18866
100.0%
2025-01-08T18:33:23.280649image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 113196
25.0%
0 75464
16.7%
2 37732
 
8.3%
- 37732
 
8.3%
: 37732
 
8.3%
4 37732
 
8.3%
5 18866
 
4.2%
8 18866
 
4.2%
T 18866
 
4.2%
3 18866
 
4.2%
Other values (2) 37732
 
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 320722
70.8%
Other Punctuation 56598
 
12.5%
Dash Punctuation 37732
 
8.3%
Uppercase Letter 37732
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 113196
35.3%
0 75464
23.5%
2 37732
 
11.8%
4 37732
 
11.8%
5 18866
 
5.9%
8 18866
 
5.9%
3 18866
 
5.9%
Other Punctuation
ValueCountFrequency (%)
: 37732
66.7%
. 18866
33.3%
Uppercase Letter
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 415052
91.7%
Latin 37732
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
1 113196
27.3%
0 75464
18.2%
2 37732
 
9.1%
- 37732
 
9.1%
: 37732
 
9.1%
4 37732
 
9.1%
5 18866
 
4.5%
8 18866
 
4.5%
3 18866
 
4.5%
. 18866
 
4.5%
Latin
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 452784
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 113196
25.0%
0 75464
16.7%
2 37732
 
8.3%
- 37732
 
8.3%
: 37732
 
8.3%
4 37732
 
8.3%
5 18866
 
4.2%
8 18866
 
4.2%
T 18866
 
4.2%
3 18866
 
4.2%
Other values (2) 37732
 
8.3%

repatriated
Text

Missing 

Distinct2
Distinct (%)< 0.1%
Missing3910
Missing (%)20.7%
Memory size147.5 KiB
2025-01-08T18:33:23.321648image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.674511902
Min length4

Characters and Unicode

Total characters69912
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowfalse
2nd rowfalse
3rd rowfalse
4th rowfalse
5th rowfalse
ValueCountFrequency (%)
false 10088
67.5%
true 4868
32.5%
2025-01-08T18:33:23.417936image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 14956
21.4%
f 10088
14.4%
a 10088
14.4%
l 10088
14.4%
s 10088
14.4%
t 4868
 
7.0%
r 4868
 
7.0%
u 4868
 
7.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 69912
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 14956
21.4%
f 10088
14.4%
a 10088
14.4%
l 10088
14.4%
s 10088
14.4%
t 4868
 
7.0%
r 4868
 
7.0%
u 4868
 
7.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 69912
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 14956
21.4%
f 10088
14.4%
a 10088
14.4%
l 10088
14.4%
s 10088
14.4%
t 4868
 
7.0%
r 4868
 
7.0%
u 4868
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 69912
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 14956
21.4%
f 10088
14.4%
a 10088
14.4%
l 10088
14.4%
s 10088
14.4%
t 4868
 
7.0%
r 4868
 
7.0%
u 4868
 
7.0%

isSequenced
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:23.459788image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters94330
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowfalse
2nd rowfalse
3rd rowfalse
4th rowfalse
5th rowfalse
ValueCountFrequency (%)
false 18866
100.0%
2025-01-08T18:33:23.559474image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
f 18866
20.0%
a 18866
20.0%
l 18866
20.0%
s 18866
20.0%
e 18866
20.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 94330
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
f 18866
20.0%
a 18866
20.0%
l 18866
20.0%
s 18866
20.0%
e 18866
20.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 94330
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
f 18866
20.0%
a 18866
20.0%
l 18866
20.0%
s 18866
20.0%
e 18866
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94330
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
f 18866
20.0%
a 18866
20.0%
l 18866
20.0%
s 18866
20.0%
e 18866
20.0%

gbifRegion
Text

Missing 

Distinct7
Distinct (%)< 0.1%
Missing3929
Missing (%)20.8%
Memory size147.5 KiB
2025-01-08T18:33:23.611472image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length11.54395126
Min length4

Characters and Unicode

Total characters172432
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNORTH_AMERICA
2nd rowNORTH_AMERICA
3rd rowNORTH_AMERICA
4th rowNORTH_AMERICA
5th rowNORTH_AMERICA
ValueCountFrequency (%)
north_america 10775
72.1%
africa 1928
 
12.9%
latin_america 1203
 
8.1%
asia 590
 
3.9%
europe 303
 
2.0%
oceania 136
 
0.9%
antarctica 2
 
< 0.1%
2025-01-08T18:33:23.718939image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 30473
17.7%
R 24986
14.5%
I 15837
9.2%
C 14046
8.1%
E 12720
7.4%
N 12116
 
7.0%
T 11982
 
6.9%
_ 11978
 
6.9%
M 11978
 
6.9%
O 11214
 
6.5%
Other values (6) 15102
8.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 160454
93.1%
Connector Punctuation 11978
 
6.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 30473
19.0%
R 24986
15.6%
I 15837
9.9%
C 14046
8.8%
E 12720
7.9%
N 12116
 
7.6%
T 11982
 
7.5%
M 11978
 
7.5%
O 11214
 
7.0%
H 10775
 
6.7%
Other values (5) 4327
 
2.7%
Connector Punctuation
ValueCountFrequency (%)
_ 11978
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 160454
93.1%
Common 11978
 
6.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 30473
19.0%
R 24986
15.6%
I 15837
9.9%
C 14046
8.8%
E 12720
7.9%
N 12116
 
7.6%
T 11982
 
7.5%
M 11978
 
7.5%
O 11214
 
7.0%
H 10775
 
6.7%
Other values (5) 4327
 
2.7%
Common
ValueCountFrequency (%)
_ 11978
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 172432
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 30473
17.7%
R 24986
14.5%
I 15837
9.2%
C 14046
8.1%
E 12720
7.4%
N 12116
 
7.0%
T 11982
 
6.9%
_ 11978
 
6.9%
M 11978
 
6.9%
O 11214
 
6.5%
Other values (6) 15102
8.8%

publishedByGbifRegion
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-08T18:33:23.769400image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters245258
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNORTH_AMERICA
2nd rowNORTH_AMERICA
3rd rowNORTH_AMERICA
4th rowNORTH_AMERICA
5th rowNORTH_AMERICA
ValueCountFrequency (%)
north_america 18866
100.0%
2025-01-08T18:33:23.873115image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 37732
15.4%
A 37732
15.4%
N 18866
7.7%
O 18866
7.7%
T 18866
7.7%
H 18866
7.7%
_ 18866
7.7%
M 18866
7.7%
E 18866
7.7%
I 18866
7.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 226392
92.3%
Connector Punctuation 18866
 
7.7%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 37732
16.7%
A 37732
16.7%
N 18866
8.3%
O 18866
8.3%
T 18866
8.3%
H 18866
8.3%
M 18866
8.3%
E 18866
8.3%
I 18866
8.3%
C 18866
8.3%
Connector Punctuation
ValueCountFrequency (%)
_ 18866
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 226392
92.3%
Common 18866
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 37732
16.7%
A 37732
16.7%
N 18866
8.3%
O 18866
8.3%
T 18866
8.3%
H 18866
8.3%
M 18866
8.3%
E 18866
8.3%
I 18866
8.3%
C 18866
8.3%
Common
ValueCountFrequency (%)
_ 18866
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 245258
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
R 37732
15.4%
A 37732
15.4%
N 18866
7.7%
O 18866
7.7%
T 18866
7.7%
H 18866
7.7%
_ 18866
7.7%
M 18866
7.7%
E 18866
7.7%
I 18866
7.7%

level0Gid
Text

Missing 

Distinct94
Distinct (%)0.7%
Missing5871
Missing (%)31.1%
Memory size147.5 KiB
2025-01-08T18:33:23.950816image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters38985
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st rowUSA
2nd rowUSA
3rd rowUSA
4th rowUSA
5th rowUSA
ValueCountFrequency (%)
usa 9272
71.4%
can 599
 
4.6%
ken 564
 
4.3%
mex 510
 
3.9%
egy 334
 
2.6%
idn 283
 
2.2%
ecu 207
 
1.6%
grc 118
 
0.9%
cmr 80
 
0.6%
tza 79
 
0.6%
Other values (84) 949
 
7.3%
2025-01-08T18:33:24.079852image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 10201
26.2%
U 9653
24.8%
S 9423
24.2%
N 1727
 
4.4%
E 1709
 
4.4%
C 1189
 
3.0%
M 764
 
2.0%
G 624
 
1.6%
K 592
 
1.5%
X 510
 
1.3%
Other values (16) 2593
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 38985
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 10201
26.2%
U 9653
24.8%
S 9423
24.2%
N 1727
 
4.4%
E 1709
 
4.4%
C 1189
 
3.0%
M 764
 
2.0%
G 624
 
1.6%
K 592
 
1.5%
X 510
 
1.3%
Other values (16) 2593
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Latin 38985
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 10201
26.2%
U 9653
24.8%
S 9423
24.2%
N 1727
 
4.4%
E 1709
 
4.4%
C 1189
 
3.0%
M 764
 
2.0%
G 624
 
1.6%
K 592
 
1.5%
X 510
 
1.3%
Other values (16) 2593
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38985
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 10201
26.2%
U 9653
24.8%
S 9423
24.2%
N 1727
 
4.4%
E 1709
 
4.4%
C 1189
 
3.0%
M 764
 
2.0%
G 624
 
1.6%
K 592
 
1.5%
X 510
 
1.3%
Other values (16) 2593
 
6.7%

level0Name
Text

Missing 

Distinct94
Distinct (%)0.7%
Missing5871
Missing (%)31.1%
Memory size147.5 KiB
2025-01-08T18:33:24.168371image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length13
Mean length11.20176991
Min length4

Characters and Unicode

Total characters145567
Distinct characters54
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st rowUnited States
2nd rowUnited States
3rd rowUnited States
4th rowUnited States
5th rowUnited States
ValueCountFrequency (%)
united 9285
41.2%
states 9272
41.2%
canada 599
 
2.7%
kenya 564
 
2.5%
méxico 510
 
2.3%
egypt 334
 
1.5%
indonesia 283
 
1.3%
ecuador 207
 
0.9%
greece 118
 
0.5%
cameroon 80
 
0.4%
Other values (101) 1268
 
5.6%
2025-01-08T18:33:24.318964image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 28368
19.5%
e 20349
14.0%
a 13650
9.4%
n 11735
8.1%
i 10936
 
7.5%
d 10558
 
7.3%
s 9727
 
6.7%
9525
 
6.5%
S 9390
 
6.5%
U 9288
 
6.4%
Other values (44) 12041
8.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 113606
78.0%
Uppercase Letter 22424
 
15.4%
Space Separator 9525
 
6.5%
Other Punctuation 12
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 28368
25.0%
e 20349
17.9%
a 13650
12.0%
n 11735
10.3%
i 10936
 
9.6%
d 10558
 
9.3%
s 9727
 
8.6%
o 1481
 
1.3%
c 1062
 
0.9%
y 961
 
0.8%
Other values (19) 4779
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
S 9390
41.9%
U 9288
41.4%
C 869
 
3.9%
K 578
 
2.6%
M 562
 
2.5%
E 550
 
2.5%
I 377
 
1.7%
G 165
 
0.7%
T 117
 
0.5%
B 95
 
0.4%
Other values (11) 433
 
1.9%
Other Punctuation
ValueCountFrequency (%)
' 9
75.0%
. 2
 
16.7%
, 1
 
8.3%
Space Separator
ValueCountFrequency (%)
9525
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 136030
93.4%
Common 9537
 
6.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 28368
20.9%
e 20349
15.0%
a 13650
10.0%
n 11735
8.6%
i 10936
 
8.0%
d 10558
 
7.8%
s 9727
 
7.2%
S 9390
 
6.9%
U 9288
 
6.8%
o 1481
 
1.1%
Other values (40) 10548
 
7.8%
Common
ValueCountFrequency (%)
9525
99.9%
' 9
 
0.1%
. 2
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 145034
99.6%
None 533
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 28368
19.6%
e 20349
14.0%
a 13650
9.4%
n 11735
8.1%
i 10936
 
7.5%
d 10558
 
7.3%
s 9727
 
6.7%
9525
 
6.6%
S 9390
 
6.5%
U 9288
 
6.4%
Other values (41) 11508
7.9%
None
ValueCountFrequency (%)
é 510
95.7%
ç 14
 
2.6%
ô 9
 
1.7%

level1Gid
Text

Missing 

Distinct341
Distinct (%)2.6%
Missing5888
Missing (%)31.2%
Memory size147.5 KiB
2025-01-08T18:33:24.502883image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.678301741
Min length7

Characters and Unicode

Total characters99649
Distinct characters38
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)0.8%

Sample

1st rowUSA.7_1
2nd rowUSA.7_1
3rd rowUSA.39_1
4th rowUSA.39_1
5th rowUSA.39_1
ValueCountFrequency (%)
usa.30_1 2868
22.1%
usa.7_1 1293
 
10.0%
usa.24_1 551
 
4.2%
usa.6_1 447
 
3.4%
usa.33_1 446
 
3.4%
usa.3_1 436
 
3.4%
usa.50_1 423
 
3.3%
can.2_1 336
 
2.6%
usa.5_1 290
 
2.2%
idn.23_1 277
 
2.1%
Other values (331) 5611
43.2%
2025-01-08T18:33:24.735491image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 15054
15.1%
. 12978
13.0%
_ 12943
13.0%
A 10197
10.2%
U 9639
9.7%
S 9423
9.5%
3 5867
 
5.9%
0 3709
 
3.7%
2 2826
 
2.8%
7 2156
 
2.2%
Other values (28) 14857
14.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 39039
39.2%
Decimal Number 34689
34.8%
Other Punctuation 12978
 
13.0%
Connector Punctuation 12943
 
13.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 10197
26.1%
U 9639
24.7%
S 9423
24.1%
N 1727
 
4.4%
E 1709
 
4.4%
C 1175
 
3.0%
M 763
 
2.0%
G 659
 
1.7%
K 627
 
1.6%
X 510
 
1.3%
Other values (16) 2610
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 15054
43.4%
3 5867
 
16.9%
0 3709
 
10.7%
2 2826
 
8.1%
7 2156
 
6.2%
4 1816
 
5.2%
5 1294
 
3.7%
6 970
 
2.8%
8 551
 
1.6%
9 446
 
1.3%
Other Punctuation
ValueCountFrequency (%)
. 12978
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12943
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 60610
60.8%
Latin 39039
39.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 10197
26.1%
U 9639
24.7%
S 9423
24.1%
N 1727
 
4.4%
E 1709
 
4.4%
C 1175
 
3.0%
M 763
 
2.0%
G 659
 
1.7%
K 627
 
1.6%
X 510
 
1.3%
Other values (16) 2610
 
6.7%
Common
ValueCountFrequency (%)
1 15054
24.8%
. 12978
21.4%
_ 12943
21.4%
3 5867
 
9.7%
0 3709
 
6.1%
2 2826
 
4.7%
7 2156
 
3.6%
4 1816
 
3.0%
5 1294
 
2.1%
6 970
 
1.6%
Other values (2) 997
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99649
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 15054
15.1%
. 12978
13.0%
_ 12943
13.0%
A 10197
10.2%
U 9639
9.7%
S 9423
9.5%
3 5867
 
5.9%
0 3709
 
3.7%
2 2826
 
2.8%
7 2156
 
2.2%
Other values (28) 14857
14.9%

level1Name
Text

Missing 

Distinct339
Distinct (%)2.6%
Missing5888
Missing (%)31.2%
Memory size147.5 KiB
2025-01-08T18:33:24.923289image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length27
Mean length9.578979812
Min length3

Characters and Unicode

Total characters124316
Distinct characters64
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)0.8%

Sample

1st rowConnecticut
2nd rowConnecticut
3rd rowPennsylvania
4th rowPennsylvania
5th rowPennsylvania
ValueCountFrequency (%)
new 3505
19.9%
hampshire 2868
16.3%
connecticut 1293
 
7.3%
minnesota 551
 
3.1%
colorado 447
 
2.5%
york 446
 
2.5%
arizona 436
 
2.5%
wisconsin 423
 
2.4%
british 336
 
1.9%
columbia 336
 
1.9%
Other values (386) 6996
39.7%
2025-01-08T18:33:25.177916image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 12131
 
9.8%
i 10970
 
8.8%
e 10741
 
8.6%
n 8249
 
6.6%
o 8034
 
6.5%
s 7338
 
5.9%
r 7053
 
5.7%
t 5362
 
4.3%
4659
 
3.7%
h 4513
 
3.6%
Other values (54) 45266
36.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 101897
82.0%
Uppercase Letter 17647
 
14.2%
Space Separator 4659
 
3.7%
Dash Punctuation 102
 
0.1%
Other Punctuation 11
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 12131
11.9%
i 10970
10.8%
e 10741
10.5%
n 8249
 
8.1%
o 8034
 
7.9%
s 7338
 
7.2%
r 7053
 
6.9%
t 5362
 
5.3%
h 4513
 
4.4%
w 3956
 
3.9%
Other values (24) 23550
23.1%
Uppercase Letter
ValueCountFrequency (%)
N 4225
23.9%
H 2947
16.7%
C 2656
15.1%
M 1539
 
8.7%
A 1203
 
6.8%
W 776
 
4.4%
P 539
 
3.1%
Y 450
 
2.6%
T 407
 
2.3%
B 400
 
2.3%
Other values (16) 2505
14.2%
Other Punctuation
ValueCountFrequency (%)
, 6
54.5%
' 5
45.5%
Space Separator
ValueCountFrequency (%)
4659
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 119544
96.2%
Common 4772
 
3.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 12131
 
10.1%
i 10970
 
9.2%
e 10741
 
9.0%
n 8249
 
6.9%
o 8034
 
6.7%
s 7338
 
6.1%
r 7053
 
5.9%
t 5362
 
4.5%
h 4513
 
3.8%
N 4225
 
3.5%
Other values (50) 40928
34.2%
Common
ValueCountFrequency (%)
4659
97.6%
- 102
 
2.1%
, 6
 
0.1%
' 5
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 124049
99.8%
None 267
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 12131
 
9.8%
i 10970
 
8.8%
e 10741
 
8.7%
n 8249
 
6.6%
o 8034
 
6.5%
s 7338
 
5.9%
r 7053
 
5.7%
t 5362
 
4.3%
4659
 
3.8%
h 4513
 
3.6%
Other values (46) 44999
36.3%
None
ValueCountFrequency (%)
á 122
45.7%
é 69
25.8%
ó 37
 
13.9%
í 24
 
9.0%
ô 10
 
3.7%
ý 3
 
1.1%
ö 1
 
0.4%
š 1
 
0.4%

level2Gid
Text

Missing 

Distinct973
Distinct (%)7.5%
Missing5935
Missing (%)31.5%
Memory size147.5 KiB
2025-01-08T18:33:25.371814image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length12
Median length10
Mean length10.09287758
Min length7

Characters and Unicode

Total characters130511
Distinct characters38
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique358 ?
Unique (%)2.8%

Sample

1st rowUSA.7.5_1
2nd rowUSA.7.5_1
3rd rowUSA.39.9_1
4th rowUSA.39.51_1
5th rowUSA.39.9_1
ValueCountFrequency (%)
usa.30.2_1 2600
 
20.1%
usa.7.5_1 626
 
4.8%
usa.24.11_1 354
 
2.7%
usa.7.3_1 328
 
2.5%
usa.6.27_1 268
 
2.1%
idn.23.5_1 244
 
1.9%
usa.30.4_1 164
 
1.3%
egy.17.9_1 162
 
1.3%
usa.50.26_1 162
 
1.3%
usa.7.1_1 144
 
1.1%
Other values (963) 7879
60.9%
2025-01-08T18:33:25.630287image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 25827
19.8%
1 18015
13.8%
_ 12931
9.9%
A 10186
 
7.8%
U 9638
 
7.4%
S 9412
 
7.2%
2 8521
 
6.5%
3 7867
 
6.0%
0 4300
 
3.3%
5 3455
 
2.6%
Other values (28) 20359
15.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 52960
40.6%
Uppercase Letter 38793
29.7%
Other Punctuation 25827
19.8%
Connector Punctuation 12931
 
9.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 10186
26.3%
U 9638
24.8%
S 9412
24.3%
E 1706
 
4.4%
N 1692
 
4.4%
C 1133
 
2.9%
M 752
 
1.9%
G 632
 
1.6%
K 627
 
1.6%
X 510
 
1.3%
Other values (16) 2505
 
6.5%
Decimal Number
ValueCountFrequency (%)
1 18015
34.0%
2 8521
16.1%
3 7867
14.9%
0 4300
 
8.1%
5 3455
 
6.5%
4 3300
 
6.2%
7 3051
 
5.8%
6 2166
 
4.1%
9 1295
 
2.4%
8 990
 
1.9%
Other Punctuation
ValueCountFrequency (%)
. 25827
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12931
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 91718
70.3%
Latin 38793
29.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 10186
26.3%
U 9638
24.8%
S 9412
24.3%
E 1706
 
4.4%
N 1692
 
4.4%
C 1133
 
2.9%
M 752
 
1.9%
G 632
 
1.6%
K 627
 
1.6%
X 510
 
1.3%
Other values (16) 2505
 
6.5%
Common
ValueCountFrequency (%)
. 25827
28.2%
1 18015
19.6%
_ 12931
14.1%
2 8521
 
9.3%
3 7867
 
8.6%
0 4300
 
4.7%
5 3455
 
3.8%
4 3300
 
3.6%
7 3051
 
3.3%
6 2166
 
2.4%
Other values (2) 2285
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 130511
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 25827
19.8%
1 18015
13.8%
_ 12931
9.9%
A 10186
 
7.8%
U 9638
 
7.4%
S 9412
 
7.2%
2 8521
 
6.5%
3 7867
 
6.0%
0 4300
 
3.3%
5 3455
 
2.6%
Other values (28) 20359
15.6%

level2Name
Text

Missing 

Distinct895
Distinct (%)6.9%
Missing5935
Missing (%)31.5%
Memory size147.5 KiB
2025-01-08T18:33:25.819066image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length31
Median length27
Mean length8.232000619
Min length3

Characters and Unicode

Total characters106448
Distinct characters86
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)2.4%

Sample

1st rowNew Haven
2nd rowNew Haven
3rd rowBucks
4th rowPhiladelphia
5th rowBucks
ValueCountFrequency (%)
carroll 2600
 
15.8%
new 738
 
4.5%
haven 626
 
3.8%
cass 356
 
2.2%
litchfield 328
 
2.0%
gunnison 268
 
1.6%
dogiyai 244
 
1.5%
north 204
 
1.2%
aswan 175
 
1.1%
no 166
 
1.0%
Other values (988) 10746
65.3%
2025-01-08T18:33:26.068762image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 11555
 
10.9%
r 9803
 
9.2%
o 9094
 
8.5%
l 8669
 
8.1%
e 7675
 
7.2%
n 6688
 
6.3%
i 6359
 
6.0%
s 4313
 
4.1%
C 3874
 
3.6%
3520
 
3.3%
Other values (76) 34898
32.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 85822
80.6%
Uppercase Letter 16283
 
15.3%
Space Separator 3520
 
3.3%
Decimal Number 312
 
0.3%
Dash Punctuation 303
 
0.3%
Other Punctuation 208
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 11555
13.5%
r 9803
11.4%
o 9094
10.6%
l 8669
10.1%
e 7675
8.9%
n 6688
7.8%
i 6359
 
7.4%
s 4313
 
5.0%
t 3151
 
3.7%
u 2285
 
2.7%
Other values (31) 16230
18.9%
Uppercase Letter
ValueCountFrequency (%)
C 3874
23.8%
N 1417
 
8.7%
H 1012
 
6.2%
S 996
 
6.1%
L 991
 
6.1%
M 913
 
5.6%
G 633
 
3.9%
A 630
 
3.9%
F 619
 
3.8%
B 617
 
3.8%
Other values (20) 4581
28.1%
Decimal Number
ValueCountFrequency (%)
1 144
46.2%
5 94
30.1%
2 35
 
11.2%
9 19
 
6.1%
3 7
 
2.2%
7 5
 
1.6%
8 3
 
1.0%
4 3
 
1.0%
6 1
 
0.3%
0 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
. 171
82.2%
' 36
 
17.3%
/ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
3520
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 303
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 102105
95.9%
Common 4343
 
4.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 11555
 
11.3%
r 9803
 
9.6%
o 9094
 
8.9%
l 8669
 
8.5%
e 7675
 
7.5%
n 6688
 
6.6%
i 6359
 
6.2%
s 4313
 
4.2%
C 3874
 
3.8%
t 3151
 
3.1%
Other values (61) 30924
30.3%
Common
ValueCountFrequency (%)
3520
81.0%
- 303
 
7.0%
. 171
 
3.9%
1 144
 
3.3%
5 94
 
2.2%
' 36
 
0.8%
2 35
 
0.8%
9 19
 
0.4%
3 7
 
0.2%
7 5
 
0.1%
Other values (5) 9
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 106155
99.7%
None 292
 
0.3%
IPA Ext 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 11555
 
10.9%
r 9803
 
9.2%
o 9094
 
8.6%
l 8669
 
8.2%
e 7675
 
7.2%
n 6688
 
6.3%
i 6359
 
6.0%
s 4313
 
4.1%
C 3874
 
3.6%
3520
 
3.3%
Other values (56) 34605
32.6%
None
ValueCountFrequency (%)
é 92
31.5%
á 52
17.8%
í 37
12.7%
ú 30
 
10.3%
ñ 28
 
9.6%
ô 11
 
3.8%
ó 11
 
3.8%
ı 6
 
2.1%
ö 6
 
2.1%
ü 6
 
2.1%
Other values (9) 13
 
4.5%
IPA Ext
ValueCountFrequency (%)
ə 1
100.0%

level3Gid
Text

Missing 

Distinct353
Distinct (%)15.2%
Missing16539
Missing (%)87.7%
Memory size147.5 KiB
2025-01-08T18:33:26.229127image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.96648045
Min length11

Characters and Unicode

Total characters27846
Distinct characters35
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)6.7%

Sample

1st rowKEN.14.3.2_1
2nd rowKEN.14.3.2_1
3rd rowKHM.14.1.4_2
4th rowKEN.36.1.2_1
5th rowIDN.23.5.4_1
ValueCountFrequency (%)
idn.23.5.4_1 244
 
10.5%
ken.14.3.2_1 90
 
3.9%
ken.33.6.4_1 82
 
3.5%
can.2.13.1_1 77
 
3.3%
ecu.16.5.7_1 76
 
3.3%
ken.36.1.2_1 57
 
2.4%
can.1.6.6_1 53
 
2.3%
ken.33.5.1_1 51
 
2.2%
ecu.16.5.5_1 50
 
2.1%
can.2.4.38_1 47
 
2.0%
Other values (343) 1500
64.5%
2025-01-08T18:33:26.451131image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 6981
25.1%
1 4347
15.6%
_ 2327
 
8.4%
2 1710
 
6.1%
3 1593
 
5.7%
N 1578
 
5.7%
C 1059
 
3.8%
4 1020
 
3.7%
5 1001
 
3.6%
E 852
 
3.1%
Other values (25) 5378
19.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11557
41.5%
Other Punctuation 6981
25.1%
Uppercase Letter 6981
25.1%
Connector Punctuation 2327
 
8.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 1578
22.6%
C 1059
15.2%
E 852
12.2%
A 734
10.5%
K 583
 
8.4%
D 419
 
6.0%
I 351
 
5.0%
R 282
 
4.0%
G 225
 
3.2%
U 216
 
3.1%
Other values (13) 682
9.8%
Decimal Number
ValueCountFrequency (%)
1 4347
37.6%
2 1710
 
14.8%
3 1593
 
13.8%
4 1020
 
8.8%
5 1001
 
8.7%
6 780
 
6.7%
8 333
 
2.9%
7 312
 
2.7%
9 274
 
2.4%
0 187
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 6981
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2327
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20865
74.9%
Latin 6981
 
25.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 1578
22.6%
C 1059
15.2%
E 852
12.2%
A 734
10.5%
K 583
 
8.4%
D 419
 
6.0%
I 351
 
5.0%
R 282
 
4.0%
G 225
 
3.2%
U 216
 
3.1%
Other values (13) 682
9.8%
Common
ValueCountFrequency (%)
. 6981
33.5%
1 4347
20.8%
_ 2327
 
11.2%
2 1710
 
8.2%
3 1593
 
7.6%
4 1020
 
4.9%
5 1001
 
4.8%
6 780
 
3.7%
8 333
 
1.6%
7 312
 
1.5%
Other values (2) 461
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27846
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 6981
25.1%
1 4347
15.6%
_ 2327
 
8.4%
2 1710
 
6.1%
3 1593
 
5.7%
N 1578
 
5.7%
C 1059
 
3.8%
4 1020
 
3.7%
5 1001
 
3.6%
E 852
 
3.1%
Other values (25) 5378
19.3%

level3Name
Text

Missing 

Distinct349
Distinct (%)15.0%
Missing16544
Missing (%)87.7%
Memory size147.5 KiB
2025-01-08T18:33:26.644603image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length26
Mean length10.30275624
Min length3

Characters and Unicode

Total characters23923
Distinct characters77
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)6.6%

Sample

1st rowKibarani
2nd rowKibarani
3rd rowSrae Khtum
4th rowGatarakwa
5th rowKamu Utara
ValueCountFrequency (%)
utara 245
 
6.6%
kamu 244
 
6.5%
no 93
 
2.5%
kibarani 90
 
2.4%
siana 82
 
2.2%
abbotsford 77
 
2.1%
talag 76
 
2.0%
kootenay 63
 
1.7%
east 63
 
1.7%
kumba 61
 
1.6%
Other values (431) 2644
70.7%
2025-01-08T18:33:26.903087image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3597
15.0%
r 1603
 
6.7%
i 1549
 
6.5%
o 1491
 
6.2%
t 1439
 
6.0%
1416
 
5.9%
n 1275
 
5.3%
e 1080
 
4.5%
u 868
 
3.6%
m 819
 
3.4%
Other values (67) 8786
36.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 18215
76.1%
Uppercase Letter 3591
 
15.0%
Space Separator 1416
 
5.9%
Other Punctuation 267
 
1.1%
Decimal Number 256
 
1.1%
Dash Punctuation 72
 
0.3%
Open Punctuation 54
 
0.2%
Close Punctuation 52
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 3597
19.7%
r 1603
8.8%
i 1549
8.5%
o 1491
 
8.2%
t 1439
 
7.9%
n 1275
 
7.0%
e 1080
 
5.9%
u 868
 
4.8%
m 819
 
4.5%
s 643
 
3.5%
Other values (21) 3851
21.1%
Uppercase Letter
ValueCountFrequency (%)
K 653
18.2%
U 292
 
8.1%
M 258
 
7.2%
N 256
 
7.1%
S 248
 
6.9%
C 245
 
6.8%
A 224
 
6.2%
G 177
 
4.9%
D 152
 
4.2%
I 139
 
3.9%
Other values (18) 947
26.4%
Decimal Number
ValueCountFrequency (%)
9 64
25.0%
1 63
24.6%
3 59
23.0%
2 34
13.3%
5 17
 
6.6%
7 8
 
3.1%
6 3
 
1.2%
8 3
 
1.2%
4 3
 
1.2%
0 2
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 148
55.4%
, 56
 
21.0%
/ 46
 
17.2%
' 17
 
6.4%
Space Separator
ValueCountFrequency (%)
1416
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 21806
91.2%
Common 2117
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 3597
16.5%
r 1603
 
7.4%
i 1549
 
7.1%
o 1491
 
6.8%
t 1439
 
6.6%
n 1275
 
5.8%
e 1080
 
5.0%
u 868
 
4.0%
m 819
 
3.8%
K 653
 
3.0%
Other values (49) 7432
34.1%
Common
ValueCountFrequency (%)
1416
66.9%
. 148
 
7.0%
- 72
 
3.4%
9 64
 
3.0%
1 63
 
3.0%
3 59
 
2.8%
, 56
 
2.6%
( 54
 
2.6%
) 52
 
2.5%
/ 46
 
2.2%
Other values (8) 87
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 23866
99.8%
None 57
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 3597
15.1%
r 1603
 
6.7%
i 1549
 
6.5%
o 1491
 
6.2%
t 1439
 
6.0%
1416
 
5.9%
n 1275
 
5.3%
e 1080
 
4.5%
u 868
 
3.6%
m 819
 
3.4%
Other values (60) 8729
36.6%
None
ValueCountFrequency (%)
é 25
43.9%
ñ 24
42.1%
í 4
 
7.0%
Î 1
 
1.8%
Ł 1
 
1.8%
ń 1
 
1.8%
ó 1
 
1.8%

iucnRedListCategory
Text

Missing 

Distinct8
Distinct (%)0.1%
Missing7581
Missing (%)40.2%
Memory size147.5 KiB
2025-01-08T18:33:26.959599image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters22570
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLC
2nd rowLC
3rd rowLC
4th rowLC
5th rowLC
ValueCountFrequency (%)
lc 6500
57.6%
ne 3477
30.8%
vu 401
 
3.6%
en 356
 
3.2%
nt 314
 
2.8%
cr 147
 
1.3%
dd 85
 
0.8%
ex 5
 
< 0.1%
2025-01-08T18:33:27.051181image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 6647
29.5%
L 6500
28.8%
N 4147
18.4%
E 3838
17.0%
V 401
 
1.8%
U 401
 
1.8%
T 314
 
1.4%
D 170
 
0.8%
R 147
 
0.7%
X 5
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 22570
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 6647
29.5%
L 6500
28.8%
N 4147
18.4%
E 3838
17.0%
V 401
 
1.8%
U 401
 
1.8%
T 314
 
1.4%
D 170
 
0.8%
R 147
 
0.7%
X 5
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 22570
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 6647
29.5%
L 6500
28.8%
N 4147
18.4%
E 3838
17.0%
V 401
 
1.8%
U 401
 
1.8%
T 314
 
1.4%
D 170
 
0.8%
R 147
 
0.7%
X 5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22570
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 6647
29.5%
L 6500
28.8%
N 4147
18.4%
E 3838
17.0%
V 401
 
1.8%
U 401
 
1.8%
T 314
 
1.4%
D 170
 
0.8%
R 147
 
0.7%
X 5
 
< 0.1%